o 



1 


TCGCGCCTTT 


CACCGGCACC 


51 


l^GCCGCGCGC 


ACCCCCGGGG 


101 


TTCCTGCACC 


TGAGGCCGCC 


151 


CCCAGCCCAA 


CACGGTGAAG 


201 


CTGCCGCTGC 


CCTACGGCTT 


251 


GGAGGATGCT 


CACAACTGTA 


301 


TTTCTGTCTA 


CGATGGACAT 


351 


AAATATCTTC 


CTGATATCAT 


401 


GCTACAGAAG 


GCTTTAGAAG 


451 


CCACTGAAGA 


AGTCATTAAA 


501 


GAGGATGAAG 


ATGAAAAAGA 


551 


CTTCTATAAG 


AGAAACAAGA 


601 


CCCTTCCTGA 


CATCAAGGTG 


651 


GTCATTGCCT 


GTGATGGCAT 


701 


AGATTTCATT 


CAATCAAAGA 


751 


GGTTATTGTC 


ATCCATTGTG 


801 


GACACTTCTG 


GGGATGGTAC 


851 


TTGCTTCAAG 


CCCCGAAACA 


901 


GAAAACTAGA 


GGAGGTGCTC 


951 


AGCGACAAGA 


AGAAGAAGGC 


1001 


GCCCACCTAG 


ACTGTTTTCT 


1051 


CTTTTTCCTT 


TAGCCTTAGC 


1101 


GTGGCTTCAC 


TCCGCCCATT 


1151 


CGGGAGCCTC 


TGCTGTCCTT 


1201 


CCGGTTCTGT 


GCCTGTGCTC 


1251 


TGGTTTTTAC 


TCTGTGAACT 


1301 


CTCCATGGCC 


CTCGGCCGCT 


1351 


CAACACTTTT 


TCAGACTAAA 


1401 


AAAAAAAAAA 


AAAAAAAAAA 



TTGCGTCGGT CGCGCCGCGG GGCCTGCTCC 
CTTCGGCTCC GGCACGGGTC GCGCCCAGCT 
GGCCAGCCGC CGCCATGGGT GCCTACCTCT 
TGCTCCGGGG ACGGGGTCGG CGCCCCGCGC 
CTCCGCCATG CAAGGCTGGC GCGTCTCCAT 
TTCCTGAGCT GGACAGTGAG ACAGCCATGT 
GGAGGGGAGG AAGTTGCCTT GTACTGTGCC 
CAAAGATCAG AAGGCCTACA AGGAAGGCAA 
ATGCCTTCTT GGCTATTGAC GCCAAATTGA 
GAGCTGGCAC AGATTGCAGG GCGACCCACT 
AAAAGTAGCT GATGAAGATG ATGGGGACCA 
ACCTGCCACC TGAGGAACAG ATGATTTCAG 
CTGACTCTCA CTGACGACCA TGAATTCATG 
CTGGAATGTG ATGAGCAGCC AGGAAGTTGT 
TCAGCCAGCG TGATGAAAAT GGGGAGCTTC 
GAAGAGCTGC TGGATCAGTG CCTGGCACCA 
AGGGTGTGAC AACATGACCT GCATCATCAT 
CAGCAGAGCT CCAGCCAGAG AGTGGCAAGC 
TCTACTGAGG GGGCTGAAGA AAATGGCAAC 
CAAGCGAGAC TAGCAGTCAT CCAGACCCCT 
GAGCCCTCCG GACCTGAGAC TGAGTTTTGT 
AGTGGGTATG AGGTGTGCAG GGGGAGCTGG 
CCAAAGAGGG CTCTCCCTCC ACACTGCAGC 
CCCAGCCGCC TCTGCTCCTC GGGCTCATCA 
TGTTGTGTTG GAGGGAAGGA CTGGCGGTTC 
TTATTTAAGG ACATTCTTTT TTATTGGCGG 
TGCACCCGCT CTCTGTTGTA CACTTTCAAT 
GGCCAAAACC TAATCGTTAA AAAAAAAAAA 
AAAAAAAAAA AAAAA (SEQ ID N0;1) 



m 5 



FEATURES: 

5^ UTR: 1-134 
Start: 135 
Stop: 981 
3' UTR: 984 

Homologous proteins: 

Top 10 BLAST Hits 



CRAil8000005093929 
CRAI18000005157254 
CRAI 18000005062172 
CRAI 1000682325310 
CRAI18000004879328 
CRAI180O0004973158 
CRA) 18000005006718 
CRAI 18000004973159 
CRAI89000000194666 
CRAI 18000004972853 
CRAI 89000000203611 
CRAI 18000004915558 
CRAI 18000005051471 



/dataset=nraa 
/dataset=nraa 
/dataset=nraa 
/dataset=nraa / 
/dataset=nraa 
/dataset=nraa 
/dataset=nraa 
/dataset=nraa 
/dataset=nraa 
/datasGt=nraa 
/dataset=nraa 
/dataset=nraa 
/dataset=nraa 



/length=546 

/length=542 

/length^543 

length=356 

/length=348 

/length=370 

/length=414 

/length=414 

/length=352 

/length=464 

/length=662 

/length=468 

/length=468 



/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 
/altid=gi 



14505999 

16679793 

13122591 

7508306 / 

11352582 

11171963 

11171964 

11076914 

17291977 

! 6320934 

17302240 

16319415 

11622933 



Score 


E 




301 


6e- 


81 


284 


le- 


75 


283 


le- 


75 


161 


le- 


38 


161 


le- 


38 


154 


9e- 


37 


154 


le- 


36 


152 


3e- 


36 


151 


Be- 


36 


145 


4e- 


34 


141 


9e- 


33 


140 


2e- 


32 


140 


2e- 


32 
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BLAST to dbEST: 




Score 




E 


gb|Br34i923 tBF341923 


602016449F1 NCI_CGAP_Brn64 Homo sapiens cO. . 


878 


0 


0 


gb|AW370275|AW37027 5 


RCl-BT0254-l31199-013-f 12 BT0254 Homo sapi . . 


844 


0 


0 


gb|BE3O5O43(BE305O43 


601186706F1 NIH_MGC_15 Homo sapiens cDNA c. . 


835 


0 


0 


gb|BE249868lBE249868 


600942905F1 NIH_MGC_15 Homo sapiens cDNA c. 


835 


0 


0 


gb|AI4I7892|AI417892 


tgSSbOl.xl NCI CGAP Pr28 Homo sapiens cDNA. - 


823 


0 


0 


gblBE896031 tBE896031 


601438825F1 NIH_MGC_72 Homo sapiens cDNA c. 


815 


0 


0 


gb)Al686098)AI686098 


tt92a01.xl NCI CGAP Pr28 Homo sapiens cDNA. . 


815 


0 


.0 


gb|AU143285|AU143285 AU143285 Y79AA1 Homo sapiens cDNA clone Y7 . . 


813 


0 


0 


gb!AU142966)AUl42966 AU142966 Y79AA1 Homo sapiens cDNA clone Y7 . . 


813 


0 


0 


gb|BE903477 IBE903477 


601676724F1 NIH_MGC_21 Homo sapiens cDNA c. 


813 


0 


0 


gblBE891121lBE891121 


601432236F1 NIH MGC 72 Homo sapiens cDNA c. . 


813 


0 


.0 


gb|BE886682 IBE886682 


601507935F1 NIH_MGC_71 Homo sapiens cDNA c. 


813 


0 


0 


gb|BE797128 IBE797128 


601587118F1 NIH_MGC_7 Homo sapiens cDNA cl.. 


813 


0 


0 


gb|BE796239|BE796239 


601591808F1 NIH_MGC_7 Homo sapiens cDNA cl.. 


813 


0 


0 


gblBE793727 1BE793727 


601588924F1 NIH MGC 7 Homo sapiens cDNA cl.. 


813 


0 


.0 


gb|BE781616|BE7816l6 


601467472F1 NIH MGC 67 Homo sapiens cDNA c. 


813 


0 


0 



i1 1 
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EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 
gb|BF341923 iBF341923 Brain 
gb|AW370275|AW370275 Breast 
gblBE305043 IBE305043 Colon adenocarcinoma 
gb|BE24 9868|BE24 98 68 Colon adenocarcinoma 
gb|AI417892|AI417892 Prostate 
gbtBE896031lBE396031 Skin melanotic melanoma 
gb|AI686098)AI686098 Prostate 
gb|AUl43285|AUl43285 Eye-retinoblastoma 
gb[BE903477 IBE903477 Placenta Choriocarcinoma 
gb|BE891121 IBE891121 Skin-melanotic melanoma 
gb|BE886682 1BE886682 Uterus-leiomyosarcoma 
gb|BE797128|BE797128 Lung small cell carcinoma 
gb|BE796239 IBE796239 Lung small cell carcinoma 
gblBE793727 tBE793727 Lung small cell carcinoma 
gb!BE781616)BE781616 Eye-retinoblastoma 



Expression information from FCR-based tissue screening panels: 
Human leukocyte 
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1 MGAYLSQPNT VKCSGDGVGA PRLPLPYGFS AMQGWRVSME DAHNCIPELD 

51 SETAMFSVYD GHGGEEVALY CAKYLPDIIK DQKAYKEGKL QKALEDAFLA 

101 IDAKLTTEEV IKELAQIAGR PTEDEDEKEK VADEDDGDHF YKRNKNLPPE 

151 EQMISALPDI KVLTLTDDHE FWVIACDGIW NVMSSQEVVD FIQSKISQRD 

201 ENGELRLLSS IVEELLDQCL APDTSGDGTG CDNMTCIIIC FKPRNTAELQ 

251 PESGKRKLEE VLSTEGAEEN GNSDKKKKAK RD (SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOCOOOOl PS 000 01 ASN_GLYCOSYLATION 
N-glycosylation site 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 4 



2 197-199 SQR 

3 253-255 SGK 

4 273-275 SDK 

[3] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 



Number of 


matches : 


10 


1 


38-41 


SMED 


2 


57-60 


SVYD 


3 


106-109 


TTEE 


4 


122-125 


TEDE 


5 


164-167 


TLTD 


6 


184-187 


SSQE 


7 


197-200 


SQRD 


8 


210-213 


SIVE 


9 


224-227 


TSGD 


10 


229-232 


TGCD 



[4] PDOC00008 PSOOOOB MYRISTYL 
M-myristoylation site 

Number of matches: 2 

1 2-7 GAYLSQ 

2 266-271 GAEENG 

[5] PDOC00009 PS00009 AMIDATION 
Amidation site 



233-236 NMTC 



1 



10-12 TVK 



253-256 SGKR 



f6] PDOC00792 PS01032 PP2C 
Protein phosphatase 2C signature 



55-63 MFSVYDGHG 




BLAST Alignment to Top Hit: 

>CRAi 18000005093929 /ciataset=nraa /length=546 /altid=gi ( 4505999 

/def=ref |NP_002698. 1 1 protein phosphatase IG (formerly 
2C) r magnesium-dependent, gamma isoform; protein 
phosphatase IG (formerly 2C) , ; protein phosphatase 2, 
catalytic subunit, gamma isoform [Homo sapiens] 
/org=Homo sapiens /taxon=9606 
Length =54 6 

Score = 301 bits (763), Expect = 6e-81 

Identities = 146/146 (100%), Positives = 146/146 (100%) 

Query: 137 GDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEWDFIQSKI 196 

GDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEVVDFIQSKI 
Sbjct: 401 GDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEFMVIACDGXWNVMSSQEVVDFIQSKI 4 60 

Query: 197 SQRDENGELRLLSSrVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTAELQPESGKR 256 

SQRDENGELRLLSSrVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTAELQPESGKR 
Sbjct: 4 61 SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTAELQPESGKR 520 

Query: 257 KLEEVLSTEGAEENGNSDKKKKAKRD 282 

KLEEVLSTEGAEENGNSDKKKKAKRD 
Sbjct: 521 KLEEVLSTEGAEENGNSDKKKKAKRD 54 6 (SEQ ID N0:4) 



Score = 284 bits (718), Expect = le-75 

Identities = 137/139 (98%), Positives = 138/139 (98%) 

Query: 1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDSETAMFSVYD 60 

MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDSETAMFSVYD 
Sbjct: 1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDSETAMFSVYD 60 

Query: 61 GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAIDAKLTTEEVIKELAQIAGR 120 

GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAIDAKLTTEEVIKELAQIAGR 
Sbjct: 61 GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAIDAKLTTEEVIKELAQIAGR 120 

Query: 121 PTEDEDEKEKVADEDDGDH 139 

PTEDEDEKEKVADEDD D+ 
Sbjct: 121 PTEDEDEKEKVADEDDVDN 139 (SEQ ID NO: 5) 



>CRAI 18000005157254 /dataset^nraa /length=542 /altid=gi ! 6679793 

/def=ref |NP_032040. 1 1 fibroblast growth factor inducible 
13 [Mus musculus] /org=Mus musculus /taxon=10Q90 
Length = 542 

Score = 284 bits (718), Expect = le-75 

Identities = 139/146 (95%), Positives = 141/146 (96%), Gaps = 1/146 (0%) 

Query: 137 GDHFYKRNKNLPPEEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEVVDFIQSKI 196 

GDHFYKRNKNLPP+EQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEVVDFIQSKI 
Sbjct: 398 GDHFYKRNKNLPPQEQMISALPDIKVLTLTDDHEFMVIACDGIWNVMSSQEVVDFIQSKI 457 

Query: 197 SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTAELQPESGKR 256 

SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNT ELQ ESGKR 
Sbjct: 458 SQRDENGELRLLSSIVEELLDQCLAPDTSGDGTGCDNMTCIIICFKPRNTVELQAESGKR 517 



FIGURE 2, page 2 of 3 



Query: 257 KLEEVLSTEGAEENGNSDKKKKAKRD 2 82 

KLEE LSTEGAE+ GNSD KKKAKRD 
Sbjct: 518 KLEEALSTEGAEDTGNSD-KKKAKRD 542 (SEQ ID NO: 6) 



Score = 279 bits (706), Expect = 3e-74 

Identities = 133/139 (95%), Positives = 138/139 (98%) 



Query: 


1 


MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDSETAMFSVYD 


60 






MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAl^NCIPELD+ETANFSVYD 




Sbjct: 


1 


MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPELDNETAMFSVYD 


60 


Query: 


61 


GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALEDAFLAIDAKLTTEEVIKELAQIAGR 


120 






GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKAL+DAFLAIDAKLTTEEVIKELAQIAGR 




Sbjct: 


61 


GHGGEEVALYCAKYLPDIIKDQKAYKEGKLQKALQDAFLAIDAKLTTEEVIKELAQIAGR 


120 


Query: 


121 


PTEDEDEKEKVADEDDGDH 139 








PTEDED+K+KVADEDD D+ 




Sbjct: 


121 


PTEDEDDKDKVADEDDVDN 139 (SEQ ID NO: 7} 





Hgtmer search results (Pfam) : 

Model Description Score E-value N 

PF00481 Protein phosphatase 2C 176.2 4.2e-49 2 
PF01722 BolA-like protein 3.7 9.5 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00481 1/2 25 102 . . 1 95 [ . 74.1 4 . 6e-20 

PF01722 1/1 99 110 .. 76 87 .] 3.7 9.5 

PF00481 2/2 137 228 ., 197 301 .] 102.2 4.5e-28 




S5E1 

id 
111 



1 AAAGAATCTT tTTTTTTTTT TTGAGACGGA GTTGCTCTGT CACCCAGGGT 

51 GGAGTGCAGT GGCGCCATCT TGGTTCACTG CAACCTCCGC CTCTGGGGTT 

101 CAAGCTATTC GCCTGCCTTA GCCTCCCAAG TAGCTGGGAT TACAGGAGCG 

151 CACCACTACG CCTGGCTAAT TTTTGTATTT TTAGTAGAGA CGGGTTTCAC 

201 ATGTTGGCCA GGCTGGTCTC GAACTTCTGG CCTCAAGTGA TCCACCACCC 

251 CCCCTTGGCC TCCCAAAGTG CTGGGATTAC AAGTGTGAGC CACTGTGCCC 

301 GGCTGAAAAG AATCAATTTT GTCATAGTTT GGAGAATTTC TCCTTTTCTC 

351 TCCATCCCTT GAATGCAATT TATTACCAAA TCTGTCTTAT TTGTTATTGT 

401 CTAATTTGTC CTTTCATCTG GATTCCCATT GCCACCCTGC GTGGTACCAC 

4 51 CTTACTCCCA GCTCTTCTCA TCTCCTGCTT AGAGTAAGAG CTCTCTAACT 

501 AGTAGCAGTG CCCCAGGCCA GGCGCGGTGG CTCACGCCTG TAATCCCAGC 

551 ACTTTGGGAG GCTGAGGCGG GTAGATCACG AGGTCAGGAG TTCGAGACCA 

601 GCCTGGCCAA CATGGTGAAA CCCCGTCTCT ACTAAAAATA CAAAAATTAG 

651 CCAGGCGTGG AATCCTAGCT ACTCGGGAGG CTGAGGCAGG AGAATCGCTT 

101 GAACCTGGGA GGCGGAGGTT GCAGTGAGCC AAGATAGCGC CACTGCACTG 

751 CAGCCTGGGC AACAAGAGCG AAACTCTGTC TTAAAAAAAA AACAATAGTA 

801 GGCGGGGTGC GGTGGCTCAC GCCTGTAATC CCAACACTTT GGGAGGCCGA 

851 GGCGGGCGGA TCACGAGGTC AGGAGATGGA GACCATCCTG GCTAACACGG 

901 TGAAACCTCG TCTCTACTAA AAATACAAAA AATTAGCCAG GCGTGGTGGC 

951 GGGCGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCGT 

1001 GAACCCGGGA GGCGGAGGTT GTAGCCTGGG CGACAGAGCG AGACTCTGTC 

1051 TCAAAAACAA ACAAAAAACA ACAACACAAC AGTGCCCCAG ACTCTCTCCC 

1101 TCCAATGTAC ACTGCATACA AAGACTAGAC AAACGATGCC AAAGGTTACA 

1151 CCAGGACGAC AATGAAGTCC AAGTCACTCA TCTTGGCGCT TCTTTTTATC 

1201 AAGCTAGCTA GTTAATATTA CCACTTACAA TCATTCTCCG AGTCCAGCCA 

1251 GCTTCCTCAG AAATCCTCCC CAAATGCAGT TCACATTCTT CCCTCACTCC 

1301 TCCAAACCTT ATAACATTAT TCCCTTTTCC TGTGGCATAA TGCAATCCAG 

1351 AGGCATCCTC TCTGAGAGCC TACATCCTTT CAATCCTCCA AGAAACAGCT 

1401 CCTCCTCCCA TCCTTGAGCT TTTCCCACCC AGAATAGGCT GTACCAAACA 

1451 TTTCTACATT GTATTAACTC GAATTATATC ACAGATCGAG GTCTTCGCTT 

1501 TCAGAAAAGA CTCACATTCT TCTCATATAG GCCTCGCATG GCCTGGCCAA 

1551 GTACTTCCTA AGCTCGGAAC AAATACTGGT CAACTTTAAT TGAACCAAAT 

1601 CGGGCGGGGT TTGCGGAGTC TGAGAGTGCA ACGTTGGGGA GAGGGGGATG 

1651 AAAACACTGG AGGACGGGCT GAAAGCGTCG AGTCCGACAC AAAAGAGGCG 

1701 TCAGACAAAA CGCCAAGAGG CTGGGGACTG GGAACGAAGG AAGAGGTTCT 

1751 GCCAGAGGCG ACCTGCCACC TGCGCGAGGA AGCGGAGTAG GACGGCGGCC 

1801 GTTGGTGGGC GTGGTCGCGC TAGTCTCGCG GGAGCGGCCG TTGGGCGGGC 

1851 CGTTGTCCCC TGCGGGCGGG GCGAGTTGCT AAGGAAATGA CTGCCCGCAG 

1901 CGCCTGGCCC CGCCGCGCAG GCCGGGCGGG GTCTGGAGCG GCGCCGTTTC 

1951 CGCTTCCGCT CCCTCACAGC TCCCGTCCCG TTACCGCCTC CTGGCCGGCC 

2001 TCGCGCCTTT CACCGGCACC TTGCGTCGGT CGCGCCGCGG GGCCTGCTCC 

2051 TGCCGCGCGC ACCCCCGGGG CTTCGGCTCC GGCACGGGTC GCGCCCAGCT 

2101 TTCCTGCACC TGAGCCGCCG GCCAGCCGCC GCCATGGGTG CCTACCTCTC 

2151 CCAGCCCAAC ACGGTGAAGT GCTCCGGGGA CGGGGTCGGC GCCCCGCGCC 

2201 TGCCGCTGCC CTACGGCTTC TCCGCCATGC AAGGCTGGCG CGTCTCCATG 

2251 GAGGTGAGGA GGCAGGGGCC CAT AGGCTGG CCGCTGCGGG GCGGGAATCT 

2301 GACGGAGAAA GAGAGCGGGG GATGGGGTCC TCCCCTGGGA AGGGTCCCAA 

2351 TTGGGAGCCT GCGGCCGCAG CGGCCGTTTG CGGGGCGACA GAGACCGCGG 

2401 GGTCAGGGCC GAGAGGGAGC TCCCGATGCT TGGGGACCGT GTGCCGGTGG 

2451 CCAGTGGCGG CGAGGGCTCG GCCATGTGGG AAGAGGCACC TTCCGCCCAC 

2501 TGACCGCCCT CTCCCCGAGC TTTGGCGCCA TCCTCTCGTG CCAACCTAGC 

2551 CCTCCAGGCT CATCAACGGT GTGAGGTTTA GTGTGGGAGT AAAGACGCAA 

2601 ATAGGGGCTA TTTATTCATT TGTTTTTCAA GGGAGAGGTG AATAGATGTG 

2651 AATAACTTTT TAAATTTTAA TATTTAAAAT ATCTGATGTG GGAAGCCTCT 

2701 TTTGGCTAGG AGTTTGACAG TGAAAGGAAC CCCGGGCAGA GTCTGTTTCA 

2751 CATTTTGGTT GCCTGGCCTT GGGCTCTTGC ATGTTAATTT CAGAGGCTGG 

2801 ACCCGACCTC CAGGAGTTGT CACTCATTTG CACTCTTTTC AGGGCCTTTT 

2951 ACTAACTTCG GAAAACTTGA ATTATGTCAG TCCCTAGGTT TTCCTTTTTA 

2901 TTATGCTTCG TTTTCTTTCT CTCTTAGGAT TTCTCTAAAA CTTAATCAGT 

2951 AATTCTCATT TGCCTGTAAT TGTAGTTTAT AGTTTCTTTG ATTGTAAGTC 

3001 ATTTCATCAA TTTTTTTCAT CACAACCTAC CTACAAAGGG CTTTTCTAGA 

3051 AAATTTTACT CTGGACAAAA GGGGAAAAGA AAAATATTGG GGGGAAAGTA 

3101 GTAGTATTAG GTAAAACTTG ATGTGAAACT ACA7y\AGAGA AGAGGGAAAA 




, ft 

m 

: itr 
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3151 CTGCGGTAGG GAGGAAAGGG AGGAAGACGG GTTAACCGTG GCTTTGTGAA 

3201 GAGCATTCTG AAGTCTAGGC AAAAGGGCCA GGGAAATACT CTGTCTGGTA 

3251 TTGAGGGTTT CTCCACCTAC CGGGTGGGCT TCAGGTAACA GCGAAATACT 

3301 GTCTCCCTTG GGAATTGTTT CAGATCCCTC GCTCCTCCTG TGGTTAGCTC 

3351 TGGAATGCCA GTATGAACTT CAATGTTTTG TTTTCCGATT CAAATTTTAT 

34 01 ATTCATAACT GACCTTAATA ACAATTTTAC AATTAGGTAT AAAATTTCAG 

34 51 GATCCTAGTG TATCCTATAG TTCATCTCAT CTGCTTTGGC TCCCTTTTTT 

3501 TTTTTTTGAG ACAGAATTTC GCTCTTGTTG CCCAGGCTGG AATGCAATGG 

3551 CGCGATCTCG GCTCACCGCA ACCTCCGCCT CCCAGGTTGG AGCAATTCTC 

3601 CTGCCTCAGC CTCCATAGTA GTTGGGATCA CAGGCATGTG CCACCACGCG 

3651 TGGCTTATTT TGTATTTTTA GTAGAGACAG GGTTTCTTCA TGTGGGTCAG 

37 01 GCTGGTGTTG AACTCCTGAC TTAGGTGATC TGCCTGCCTC GGTCTCCCAA 

37 51 AGTGCTACCA CCACNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

54 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

6001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

6051 NNNNNNNNNN NNNNNNNNNN NNNNNTGGGA TTACAGACAT GAGCCACTGC 

6101 GCCCAGCCTT ATTTAGAAAT TTCTTTAGTG AAAGATGATA AATTTTCAGT 

6151 TTTTCATTAT CTGAACATGT TTTTATCTAG CCTTTGTTCT GAAAAGATGC 

6201 TTGGACTCAG TACCCAGTTC TAGATTGACA GTTAATTTTT CTTAATTTGT 

6251 AAATGTTGTT TCATTGATTG ACTTCCATTG TTGTTCGGAA AAATTTATCA 
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TCAGCCATTT CTGACTTTTG ATCTGTGTTT TCTCTTTGGT TTCTCTTTTT 
TTTTTTCTTT tTTTTTTTTT GAGACGGAGT GTCGCTCTGT TGCCCAGGCT 
GGAGTGCAGT GGCATGATCT TGGCTCACTG CAACCTCTGC CTCCTGAGTT 
CAAGCGATTC TCCTGCCTCA GCTTCCCGAG TAGCTGGGAT TACAGGCGCC 
TGCCACCATG CCCGGCTAAT TTTTTGTATT TTCAATAGAG ACAGGGTTTC 
ACTATGTTGG CCGGGTTGGT CTCCAACTCC TGACCTCTTA ATCCGCCCGC 
CTCGGCCTCC CAAAGTGCTG GCATTACAGG CGTGAGCCAC catgcctggc 

ccatcagttg atgtagtctt aaggggacaa gagtacattt aatatttggt 
tgtgaggttc tctggaagtg acaaaactgc tttctatgga gagttaggaa 

,p,j,q,rpip,j,rprrq,rj, TTTTGAAACG GAGTCTCGCA TTGTCACCCG GGCTGGAGTG 
TAGTGGCTTG ATCTCGGCTC ACTGAAACTT CCGCCTCTTG GGTTCAAGTG 
ATTCTCCTGC CTCAGCCTTC CAAATAGCTG GGATTACAGG AGTCTGCCAC 
CAGGCCAGCT AATTTTTTTT TGTATTTTTA GTGGAGACAG gatttcacta 

tgttggccag gctggtctca agactcctga cgttgtgatc cacctgcctc 
ggcctcccaa agtgctggga ttacagatgt gagctaccgt gcccggccag 
gaattttttg tgctataaat catattttcc ttttattaaa ggcagtgtca 

ATATCTATAG TATAATTTTG AGGAGGCTGG CTATTTATTG CTGTGTAGAA 
GCTGGCTTAT TAGTGGTCAA GGGGTCATCT AGAATTGACT ATAAAGATAG 
TATTGAGCAG AAAATTCTTA AAATACCTGC atattagttt cagtcattaa 
attaatggaa aaaatataaa aagaaatatc acaagtatgc tatggggttc 
taccttaggg cattgaaggt tgaaaaacat ttattttctt atcttcagat 
tagcatctca tatcaatagc caatagctta aagcgctttt tacttactaa 

ACCAGGTCAG AATTTCTCTC TCTCTCTCTT TTTTTTTTTT AGACAGAGTC 
TCTGTCACCC AGTCTAGAGT GCAGTGGTGT GATCTCGGCT CACTGGCCTT 
TCTGTTTAAG TGATTCTCCG GCCTCAGCCT CCTGAGTAGC TGGGATTACA 
GGCATGTGCC ACGACACTTG GCTTATTTTT TGTATTTTTA GTAGAGATGG 
GGTTTCTCTG TGTTGGTCAG GCAGGTCTTG AACTCCTGAC CTCAGGTGAT 
CTGCCCGCCT CCGCCTCCCA AAGTGCTGGG ATTACAGACG TGAGCTACTG 
CGCCTGGCCA GAATTTCTTT GTCTAGAATG TGGTTAGCAA CTTTTATAAA 
AACGCATTAT TTGCATTTGA TTAGCATGCA GTACCCATTC ACAGTTCAAA 
GCTAGTATAG AATTATATCA CATGTATGCC CATGAGCATG GAGAAACTAT 
TTTCTTTTTA TTTTTTTAAG TTGGAGTTTT GCTCTTGTTG CCCAGGCTGG 
AGTGCAATGG TGCCATCTCG GCTCACTGCA GCTTCTGCCT CCTGGGTTCT 
AGCAATTTGC CTACCCCAGC CTCCCAAGTA GCTGGGATTA CAGGCACTCG 
CCACCATGCC CAGCTAACTT TTTTGTATTT TTAGTAGAGA AGGGGTTTTC 
CCATGCTGGC GAGTCTGGTC TTGAACTTCT GGCCTCAAGT GATCTGCCCG 
CCTCAGCCTC CCAAAGTGCT GGAATTACAG GCATGAGCCA CTGTGCCCGG 
CCTTTTTATT TTTTAAATTA TTTATGTATT TATTTTGAGA CAGGATCTCA 
CTCTTGCCCA TGCTTGAGTG GTAAGGAGTA TGGGATTTCT TGTGCCGGTT 
CCTTCACATC CTCACTACAC TTATCTGCCT TTACAGTGGC TCGATCATAG 
TTCACTGCAT AGCCTTCTGG GCTCAAGGGG TCTTCCAGCC TCAGCCTAAT 
ATAGGCACAT GCCACCATGA CTGGCTAATT TTTTTTTTTT AAGTTTTTTT 
TTGTAGAGAT AGGGCCTTGC AGTGTTGCCC AGGCTGAGGA ATTTTATTTA 
TGTTTATTTT ATTTATTTAT TTATTTATTT ATTTATTTAT TTATTTATTT 
ATTGAGATGG AGTCTTACAC TGTCACCCAG GCTGGAGTGC AGTTGCGCGG 
TCTCAGCTAC TGCAAGCTCC GCCTCCCGGG TTCATGCCAT TCTCCTGCCT 
CAGCCTCCCG CGTAGCTGGA CTACAGGCGC CCGCCACCAC GCCTGGCTAA 
TTTTTTGTGT GTGTTTTTAG TAGAGGCGGG GTTTCACCAT GTTAGCCAGG 
ATGGTCTCGA TTCCTGACCT CGTGATCCAC CCACATCGGC CTCCCAAAGT 
GCTGGGATTA CAGGTGTGAG CCACCATGCC TGGTCTAGAA ATTATTTTAT 
ATTTTATACC ATTGCCTTAT AAGTTCTCAA GCAACTGGAA AATACAATCA 
GAACGTATTC CTCAAGATTT CAAGGATATT TTACACAAAG TTCTATTGTC 
TGATTCCTTA GCAGTTGTTA CTACTGTTTC CCTAACCTCT AATCTTCTAT 
TGGGTTATTA GTCTTAGAAT TGAATTTTGA GAGGTAAGGG CTTGAATTTG 
AACATAGAAA TTTATACAGG TCTGATCAGT AGTTCTTGAC ATTGTATTAT 
CTGGAAACAA ATCTTTAGAA CTGAGCTTAA GATGTTTAAT GACATTTTGT 
AGACAGAGTA TGATTTCAGT GTAGTTGTTT TTGTTTCTTT CTAGATCTAG 
TTCAGAGATG AAGTATATCA ACTTTTTTTT CCTTTTTGAC CCAATGCTAG 
CAGAAAAACA ACACCTTTTA ATCATATTTA GTATTTGAAA ATGTGTATAC 
AGGTTCCTTT TTATTTTATT TATTTCTTAC AGGTTCCTTT TTAATCAGCT 
TTATTGAGAT AGAGTTCATA TACTGTATGG TTCATACCAC ATATGGTTCA 
TATACCATAC AGTATATGAA CTCACTTTAA GAGTATAATT CAGTGGGTTT 
AAGGGTATAA TTCATTCATT TTAAGGGTAT AATTCAGTGG CTTTTAGTAT 
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ATTTTCTTTT TTTTCTTCTT TTTTTCTTTT TTCGAGACAG GGTCTTGCTC 
TGTTACCCAG GCTGGAGTGC AGTGACGCAG CCTCAGCTCA CTGCAACCTC 
CACCTCCCAG GTTCAAGCGA ATTCTCTTGC CTCAGCATCC TGAGTAGCTG 
GAACTACAGG CTCACGCCAC CACACCCAGC TAATTTTTAT ATTTTCAGTA 
GAAACAGGGT TTCACCATGT TGCCCAGGCT GGTCTCGAAC TCCTGGCCTC 
AAGTGATCTG CCTGCCTCAG CCTCCCAAAG TGCTGGGATT ACAGGCATGA 
GCCACCGTGC CTGCCTGTTT TGTAGTGTAT TCAAACAGAG TTGTACAACT 
GTCACCACAA TCAGTTTTAG AACCCCAAAA AGAAACCCTG TACTCTTTAC 
CAGTCACTCC CTATCTTCCG TCCACTAACC CCTGGCATCC ACTAATTTAC 
ATGACCTCTA TGAATTTGCC TATTCTGAAC ATTTTATAAA TGGAATTCTA 
AATACACTAC CTTTTATATC TGGATGCTTT TACTAAGCAT GTGTATTTTT 
GAAATTGACT TTAAAGCTTG TTGGCCCCTG GAAGAGTAAA TTACTCTCCA 
CCCCAAGTAT TCCCTCTACC CCTCAGCTTT GCCTGTAAGT TTCTTTTTAA 
AAAAATCACA CATACATTGT TGTAGTAGAT TTAAGAATAA GTATTTTGCT 
GACCCAAGGT TCTTTTGCTT CTTTCTAGAT CAGTGCCTTG CAGGTTTTAT 
TTTACAGAGC TTAATAGAAT CAGAAATCTC TTTAAAACTC CAGTCTCATA 
TCCAGTTATC ACTCACCATC TCTGTGTTTG CAGCAATAGC CAGGCCTGGC 
CCAGAGGGAC TTGATCTCCA CTTTTGGTTT TTAGACTTTT CTGTGGCTTT 
TACCACCTGC TGTGTATCCT TGACCTCATA CTGCTGGACT CCTTTGGATG 
GATACCAGCA GGATGGTTCA GGCTCCAGTG GGCACTTTTT AAAATTCTCT 
CCTTCTGTTC AGATAGACAG AGCTCAGGCA GATCACCAAG TCTGTTGCCT 
GTGTAACCAG GAAAGAGATG CTAGTTTTCT TTTAGGCACT CCCATTTGTT 
TCTGTTGGAA CCTTCCTCAC TTAGTTGATG GAAGGGAAGC AAAAGACCCA 
GAACTCCATC TCAAATTAAT GACTTAACAA TTCTTGAATT TTCTCTTATC 
TCCTAGTTAA CTCTTTTTCT TATCTCCAGG AGGTCAGTTT TAATTATTGT 
TGTTTATTCA TTTATTTTCA TGGAGACAGG GTCTTACTGT GTTGTCCAGG 
CTGGCCTTGA ACCCCTGGCC TCAAGGAATC TTTTCACCTC AGCCTCCCAA 
AGTGCTGGAA TTACAGGCAT GAGTCACCAC ACCCAGCCTG ATATTTTTCA 
GTTGATGTAT CATAGTTGTG CCTAAGCATA ATTTTTTAAT TTTAATTTTT 
TATTTTTGGG GACAGGGTCT CCCTCTGTCG CCCAGGCTGG AGTGCAGTGA 
TGCGATCTCA GCTCACTGCA ACCTCCACCT TCTGGGTTAA AGCGATTCTC 
CTGCCTCAGC CTCCCGAGTA GCTGGGACTA CAGGTACCCA CCATCACACC 
CGGCTAAATT TTTTGTGTGT ATTTTAGTAG AGACGAGATT TCGCTGTGTT 
GCCCAGGGCG GTCTCGAACT CCTGAGCTCA GGCAATCCGC CTGCCTCGGC 
CTCCCAAAGT GCTGGGATTA CAGGCATGAA CCACCACGCC CGGCCAAGCG 
TAATATTTTT AAGGGTCATC AATGTTGTGT CATGAATCAA TCAGTGTTTC 
GTTCTTTTTT ATGGTTGAAT AATATTCCAT GGTATGGATT TGTCACATTT 
TGTTTATCCA TTCATTAGTT GATAGACATT TTGGATTTCC ACTTTTTTTT 
TTTTTTTTTT GCTATTATAA ATAGTGATAC TATGTACAAA TTTTTGTGTG 
GAAATATGTC CTCATATCTC TTGGTTATAT ACCAAAGAGT GGAAGTGCTG 
GGTCATATGG TAACTACGTG TTTAACATTT TGAGAAACTG CTAAACTGTT 
TTCCAAAGTT GCTGTACCGT CGTACATTCC TGCCAGCAAT ATATGAGGAT 
GCCAGTTCCT TCACATGTTC ACTACACTTA TCCACCTTTT TTATAATAAC 
TAATGGTGGG TGTGAGATGG TATCTCATTG TAGTTTTGAT TTGTATTTCT 
CTGATGGCTA AATGGCTAAT GATGTTTGAA CTTTTTGTTT GAGACAGAAT 
CTCACTCTGT CCAGATTCAA GCGATTCTCC TGCCTCAGCC TCCCTAGCAG 
CTGGGATTAC AGGCACATGC CACCACACCC AGCTAATTTT TTGTATTTCT 
AGTAGAGACA GGGCATTACC ATGTTGTTCA GGCTGGTGTC GAACTCCTGA 
CCTCAAAGGA TCCGCCTCCC TGGGCCTCCC AAAGTGCTGG ATTACAGGCT 
AGAGCCACCA TGCCAGGCCT TATGTTTGAA CATCTTTTAT GTGCTTATTG 
GACATTTGTG TATCTTCTTT GGAGAAATGT CTGTTCAAAG TCTTTGTCCA 
TTTTTAATTG GATTGTCTTT TTGTCTTTTG ATGTGTAAGA GTTCTTTATG 
TGTTTTGGAT ACAAGTTTGT TAGATATATG ATTTGCAAAT CTTTTCTCCA 
ATTTTTGTGG ACTTTTGCTT TCTTTTTTTG TTTTGTTTTT GTTGTTGTTG 
TTGTTGTTGT TGTTTTGGTC GGGGGACAGT CTTGCTCTGA CCACCCAGGC 
TGGAATGGAG TGGCGCGATC TTGGTTCACT GCAACCTCTG CCTCCTGAGT 
TCAAGCTATC CTGCTTCAGC CTCCCGAGTA GCTGGGACCC AGGTGTGTGC 
CACCACTCCC AGCTAATTTT TTATTTTTAG TAGAGACCGG GTTTCACCAT 
GTTGGCCAGG CTGGTCTTGA ACTACTGACC TCAGGTGATC TGCCTGCCTC 
AGCCTCCCAA AGTGCTGGGA TTACAGTCAT GAGCCACTAC ACCCTGATTC 
TTTTTGCTGG CTTTCTTTCT TTTTTTTTCT TTTTTTTTTT GAGACGGAGT 
CTCGCTCTGT TGCCAGGCTG GAGTGCAGTG GCATGATCTC GGCTCACTGC 
AACCTCTGCC TCCCGGGTTC AAGCCATTCT CCTGCCTCAG CCTCCCGAGT 
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AGCTAGGACT ATAGGCACAT GCCACCATGC GCAGCTAATT TTTGTATTTT 
TAGTAGAGAC GGGGTTTCAC CATGTTGGCC AGGATAGTCT CGATCTCTTG 
ACCTCGTGAT CCGTCCGCCT GGGCCTCCCA AAGTGCTAGG ATTACAGACG 
TGAGCCACCA CACTCAGCCT CTTTTTGCTT TCTTGATGGT GTCTTTTGAA 
ACAAAAGTTT TTACTTTTGA TAAAGTCCAA TTTGTCTATT TTGTTTGTTT 
GTTTTTGTTA AGAAGCTTTG CCTAACCCAA AGTCACGAGA ATTTTCTCTT 
AGGTTTTCTT CTAAGAGTTT TATAGTTTTA GCTGTTTCTG TGATCCATTT 
TGAGTGAATT TTTGTGAATG GTATGAGGGA GTGATCCAAC TTCATTCTTT 
TGTGTGTGGA TATCAAGTTG TCCCAGCACT ATTTGTTTAA ACCACTGTTC 
TTTTCCCCCA TTGAATTATC TTGGCATCAT TGTCAGAGAT AAATTGACCG 
TAAATGTGAG GGTTTTATTT CTGAACTCTC AAGTCCATTT CATTGGTCTA 
CATGTCCCTA TGCCAGTAAT ACACTATCTT GGTTACTGTA GCTTTTTAGT 
ACGTTTTGAA ATGTTTTTAA AATTTGTTTT TCATCTAAAT TTTAGGATTA 
ATTTGTCAAT TTCTGCACAA AAGGCACCTG GGTTTCTATA GGGGTTATGC 
AGAATCTGTA GATCAACTGG GGGAGTATTA CAGGCATGAG CCACCGTGCC 
TGGCTGACTG AGTTTTTCAT AGATGTACTC TATCAGGTTT AGGAAGTTCC 
CTTTTATTCC TAGGTTGTTG AGTCTATTTT ATATTACTTT TTTAGAGACA 
GTCTTGCTCT GTCCCTCAGG CTGTAGCACA GTGGCTCAAT CATAGCTCAC 
TGCAGCCTTG AACTCCTAGG TTCAAGAGAT CCGCCTGCCT CAGCCTTCTT 
AGTAGCTGGG ATTACATGCA TGCACCACCA TACTGGGCTA ATTTTTTAAA 
ATTTTTTATA GAGACAGGGT CTTATTACTA TGTTGCCCAG ACTGGCATTG 
AGTCTTTTTA TCATTAATGA GCACTGAATT TTGTCAAGTG CCTTTATAAT 
ACCTATTGTG ATGATCATAG GGTTTTGTTC TTTAGTCTAC CGATACGCTA 
TATTGCATTA AGTGATTTTT TTGAATGTTA AACCAACCTT GCATTTTTTT 
GGTGTATAAG TCTTATTTGA TCAATGTGTA TTATCCTTTT ATATGGTGCT 
GGATTTAGTT TGCTACTATT TTGTTGAGGA TTTTTGTGTC TATATTCATA 
AGAGATATTG GTCTGTAGTT TCTTGTGATG TCTTTGTCTG GTTTTAGAAT 
CAGGGTAATG CTGGCCTCAT AGAATGAATT GGGAAGTGTT GTCTTTTCTA 
TGTGATGGGA GAGTTTGTGA ATCATTGGTA TTAATTTTTC TGTAAATGTT 
TGGTAGAATT CACAAATAAA GGCATCTGAG CCTGGGCTCT TCTTTGTGGG 
AAGTTTTTGG CTTTTTTTTT CTTTAAAAAT TTTCATTGTG GCTGGGCATG 
GTGGCTCACG CCTGTAATCC CAGCACTTTG GGGGGCCAAG ACGGGTGAAT 
CACCTGAGGN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNGAG 
TGCAATGATT GCAGTCTTGG CTCACTGCAA CCTCTGCCAC CTGGGCTCAA 
GAGATTCTCC TGCCTCAGCC TCCTGAGTAG CTGGGATTAC AGGCATGCGC 
CACCATGCCC AGTTAATTTT TGTATTTTTA GTAGAGACGG GGATTCTCCA 
TGTTGACCAG GCTGGTCTCT AACTCCTCAC CTCAAGTGAT CCGCCCGCCT 
GGGCCTCCCA AAGTGCTGGG ATTACAGGCA TGAGCCACCA CGCCCGGCCT 
TAAAAATTTT TTTAATGTAC AGTTGAGTAG TATTTAATAC ATTCACATTG 
TTGTGTACCC AGTTTCCAGA ACTCTTCATC CTACAGAACT GAAACTCCAT 
ACCCATTAAA TGAGTCCCCA TTCTCTTTCC CCCAGCTCAT GGCAAACAGC 
ATTCTATTTT CAGTCTCTAT GAATTTGATT AGTTTAGATA CTTCATACTG 
TAAGTGGAAT CATATGGTAT TTGTCTTTTA GTGACTGCCT AATTTAAAAA 
AAATTTTTTT GAGACGGAGT CCTGCTCTGT CGCCCAGGCT GGAGTGCAGT 
GGCACCATCT CTGCTCACTG CAACCTCCAC CTCCCAGGTT CAAGTGATTG 
TCCTGCCTCG GCCTCCCACG TAGCTGGGAT TACAGGTGCT CGCCACAACA 
CCCGGCTAAT TTTTGTATTT TTAGGTAGAG ACTGGGTTTC ACCATGTTGG 
CCAGGCTGGT CTCG7UVCTCC TGACCTCAAA TTATCCACCT GCCTTGGCCT 
CCCAAAGTGC TGGGATTACA GGCGTGAGCC ACTGTGCCCA GCCTCCATGT 
TGTTTTTCAC AACACCTGTA TCATTTACAT TTCCACCAAC AGTACACAAG 
AATTTCAGTT TCTCCACATC CTTGCTAGCA GTTGTTATTA TCTGTTTTTT 
TTTAATGGTT TCTTTTTTCC TTTTTCTTTT tTTTTTTTTT TGAGACGGTC 
TTATTCTTGC TCATGCTGGA GTGCAGTGGT GCAATGTGAT AGCTCACTGC 
AGCCTCAACC TCTGGGCTCA AGCAGTCCTG CCACCTCAGC CTCCACATAG 
GTGGGACTGC AGGTGTGCAC CACCACTTGT GGCTAATTTA aaaaattttt 
TCGTAGAGAC AGAGTCTCAC AGTGTTACCC AGGCTGGTCT TGAACTTCTG 

agcacaagtg atcctcccac ctcagcctcc caaaataatg agattagaga 
catgagccaa catgcccaac cagttttgtt tgtttgtttt gttttgtgtt 
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15751 TTTGAGACAG AGTCTCACTC TATTGCCCAG GCTGGAGTGC AACGGCATGA 

15801 TCTTAGCTCA CTGCAACCTC CGCCTCCCAG GTTCAAGTGA TTCTCATGCC 

15861 CCAGCCTCCT GAGAAGCTGG GATTACAGGT GTACCACCAC ACCCAGTTAT 

15901 TTTTGTATTT TTAGTAGACA TGGGGTTTTG CCATGTTGGC CAGGCTGGTC 

15951 CCGAACTCCT GACCTCAAGT GATCTGCTCC CCTCAGCCTC CCAAGGTGCT 

16001 AGAATTAAGT TTTTCTTTCT TTCTTTCTTT CTTTTTTTTT TTTTTTTTGA 

16051 GACAGAGTCT CACTCTGTCA CCCAGGCAGG AGTGCAATGG CACGGTCTTG 

16101 GCTCATTGTA ACCTCTGCCT CCCAGATTCA AGTAGTGATT CTCCTGTCTC 

16161 AGCCTCCCAA GTAGCTGGGA TTACAGGCAT GCACCACCAC GCCCAGCTAA 

16201 TTTTTTGTAT TTTTAGTAGA AACGGGGTTT CACCATGTTG GTCAGGCTGA 

16251 TCTCAAACTC CTGACCCCAA GTGATCCACC CGCCTTGGCC TCCCAAAGTG 

16301 TTGGGATTAC AGGCGTGAGC CACTGTGCCT GGTTTTATTT TTATTATTAT 

16351 TATTTTTAAT AGTTCCTATT CTAATGGGTA TGAGGTAGTG AGGTGGGTGG 

16401 TTGTGGTGTT TTTATGAATG TTTAATTGGA AATGGGTGGC CATTGTGTGC 

164 61 AGGAAAAACC TCCTAAATTG TGTCAAACTC CTGGAAAATG AAATATCATT 

16501 CCAGTTGCAA GAATATCTTT tttTTTTTTT TTTTTTTTTA AGACAGAGTC 

16551 TCACTCTGTC ACCAGGCGGG AGTGCAGTGG CACGATCTCG GCTCACTGCA 

16601 ACCTCCGCCT CCTGGTCCGC CTCCCGGGTT CAAGTGATTC TTCTGCCTTA 

16651 GCCTCCCAAG TAGCTGGGAC TACAGGCGCG TGCCACCACT CCTGGCTAAT 

16701 TTTTTTGTAT TTTTAGTAGA GATGGGGTTT CACCATGTTG GCCAGGATGG 

167 51 TCTCAATCTC TTGACCTAGA GATCCGCCTG CCTCGGCCTC CCAAAGTGGT 

16801 GGGATTACAG GGGGGTCACC GTGCCCAGCC ACAAGAAGAT CTTGAGCATG 

16851 TGAATGATCA GAAATGATTT AGCCTATGTA GGCACTAGGC CAGGTAGTGA 

16901 AATTCAGGGA AAATAATTCA GATGCTTCTG AGCTATCACT TATGAACTAA 

16951 GAAACAGCTT AAAGCCATTA TAGTGTGTTT CCTGAAGATG AAAGCATATG 

17001 GTAAGATGAA ATAGTGATTA TTTTTTAAAA ATTACTACTC CAGAAAGGAA 

17051 AAGTTTACTA ATTTTTATTA CTAAAGTTTA CTGTTGGTGG GTGCGGTGGC 

17101 TCACACCTGT AATCCCAGCA CTTTGGGAGG CCGAGGCAGG CGGATCACCT 

17151 GAGGTCAGGA GTTCGAGACC AGCCTGACCA ATATGGTGAA ACCCCATCTC 

17201 TGCTAAAAAT AAAAAATTAG GCCGGGCGCG GTGGCTCATG CCTGTAATCC 

17251 CAGCACTTTG GGAGGCCGAG GCAGGTGGAT CACGAGGTCA GGAGATTGAG 

17301 ACCATCCTGG CTAACACGGT GAAACCCCGT CTCTGCTAAA AATACAAAAA 

17351 TCAGCCGGGC GTCTTGGCAG GCACCTGTAG TCCCAGGTAC TCAGGAGTTT 

174 01 TGAGACGGGA GAATGGCGTG AACCCGGAAG GCGGAGCTTG AAGTGAGCCG 

174 51 AGATTGCGCC GCTTCAGTCC AGCCTGGACG ACAGAGTGAG ACTCTGTCTC 

17501 TAAAAAAATA AATAAATAAA AATAAAAAAT TAGCTGGGTG TGGTGGCACG 

17551 CACCTGTACT CCCAGCTACT CGGGAGGCTG AGGCAGGAGA ATTGCTTGAA 

17 601 CCCGGGAGAT GGAGGTTGCG GGGAGCCAAG ATTGCGCCAC TGCACTCCAG 

17 651 CCTGGCGACA GAGTGAGACT CTTTCTCAGA AAAAAATATG ATAATTAAAA 

17701 GTTGAGACGT TCTTCGCCGA GAGTGGTCGG GGTTTCCTGC TTCAACAGTG 

17751 CTTGGACGGA ACCCGGCGCT CGTCCTGCAC CCCGGCCGGC CGCCCATAGC 

17801 CAGCCCTCCG TCACCTCTTC ACCATGCCCT CGGACTGCCC CAAGGCCCCC 

17851 GCCGCAGCTC CAGCGCCGCG TAGCCACCAC TGCCGCTGCC GCCGCCTCTC 

17 901 CTTAGTCGCC GGCATGACGA CCGCGTCTAC CTCGCAGGTG CGCCAGAACT 

17 951 ACCACCAAGA CTCAGAGGCC GCCATCAACC GCCAGATCAA CCTGGAGCTC 

18001 TACGCCTCCT CCATTTACCT GTGCGTGGCT TACTACTTTG ACAGCGATGA 

18051 TGTGGCTTTG AAGAACTTTG CCAAATACTT TCTTCACCAA TCTCATGAGG 

18101 AGAGGGAACA TGCTGAGAAA TTGATGAAGC TGTAGAACCA ACGAGGTGGC 

18151 CGAATCTTCC TTCAGGATAT CAAGAAACCA GACTGTGCGG GGAGAATGCG 

18201 ATGGGAGAGC GGGCTGAATG CGATGGATTA CATTTGGAAA AAATTGTGCA 

18251 TTTTGCATTA CATTTGGAAA AAAATGTGAA TCAGTCACTA CTGGAACTGC 

18301 ACAAACTGGC CACTGACAAA AATGACCCCC ATTTGTGTGA CTTCATTGAG 

18351 ACACATTACC TGAATGAACA AGTGAAGGCC ATCAAAGAAT TGGGTGACCA 

18401 CGTGACCAAC ATGCACGAGA TGGGAGCGCC CGAATCTGGC GTGGCAGAAT 

18451 ACCTCTTTGA CAAGCACACC CTGGGAGACA GTGATAATGA AAGCTAAGCC 

18501 TCAGGCTAAT TTCCCCATAG CCATAGGGTG ACTTACCTTG TCACCAAGGC 

18551 AGCGCATGTA TGTTGGGGTT TCCTTTACCT TTTCTATAAG TTGTTCCAAG 

18501 ACACCCACTT AAGTTCTTTG ATTTGTACCA TTCCTTCAAA TAAATAAATT 

18651 TGGTACCCTC CCCCCCCCCA AAAAAAAAAT GTACTGTGGG CTGGCGTAGT 

18701 GGCTCATGCC TAAATCTCAG CACTTTGGGA GGCTGAGGCG GGAGGATCAC 

18751 CTGAGGTCGG GAGTTTGAGA CCAGCCTGGG CAACATGGTG AAACCCCGTG 

18801 TCTACTAAAA ATATAAAAAC TAGCCAGTCA TGGTGGCACA CACCTGTAAT 

18851 CCCGGCTACT TGGGAGGCTG AGGCATGAGA ATCACTTGAA CCTGGGCTGC 
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18901 GGAGGTTGTA GTAAGCTGAG ATCATGCCAC TGTACTCCAG CCTGGGTGAC 

18951 AGGGAGACAT TCTCTCTCTC AAAAAAAAAA AAAAAACAAA AAAAACAAAA 

19001 CAAACCAACA AAACAAAGTA ATCCAGGAAC AACAACATGA TGAAGGACTG 

19051 CATGCAGGAC TCAGTGATGG ATGGTGGAAG ACAGCCAGGA AGTTAAGCAT 

19101 GACTCTGGTA TTAAGTGTTG TCTGGGAGAG TTAAGATTCC ATTTACAGAA 

19151 ATAAGACCTG TAGGGGAAGC TCTTGATTTT TTTTTTTTTG CAGACTGCTG 

19201 ATTTCCTGAT TACATGTGTT AAGTTTGAGG TATAGAGAGA AAGAACATCC 

19251 TGGCCGGGTG CAGTGGCTCA CACCCGTAAT CCCAGCACTT TGGGAGGCCA 

19301 AGGTGGGCAG ATCACGAGGT CCAGGAGATC GAGACCATCC TGGCCAACAT 

19351 GGTGAAACCC CGTCTCTACT AAAAATACAA AAATTAGCTG GGCGTGGCGG 

19401 CGCGTGCCTG TTATCCCAGC TACTCAGGAG GCTGAGGCAG GAGAATTGCT 

19451 TGAACCCGAG AGGCAGAGGT TGTGATGAGC CGAGATCGCG CCACTGCACT 

19501 CCACCCTGGC AACAGAGCTA GACTCTGTCT CAAAAAAAAA AAAAAGAAAG 

19551 AAAAAAAGAA CATCCTGTAG AAACAGGCAG TCAGAGGTAT AGAACTACAC 

19601 AGAATCCAAG AGATCTTTCA AGAAAAGTGA CATGCAGCAA GAGAAACTAT 

19651 CAAGGGGGTA AACAACCTAT AGAATGGGAG AAAATATTCA CAAAGTATAC 

19701 ATCCAACAAA AGTCTAATAT CCAAAATCTA TAAGGAACCT AACAAGCAAA 

19751 AAGCAAATAA CCCCCTTAAA AAGTGGGCAA AGGACATGAA CAGATACTTC 

19801 TCAAAAGACG TACATGTGGC CCACAAACAT GAAAAAACGC CCATTTCTAA 

19851 TCATCAGACA AATGCAAATT AGAACCACAA AGAGATACCA TCTCACACCA 

19901 GTCAGAACAG CTTTTGTTAA AATGTCAAAA AATGAGAAAC GTTGGTGAGG 

19951 CTGCAGAGGA AAGCAGACAC TTGTACACTG TTGGTGAAGG TGTAAATTAA 

20001 TTTAGCGTAG GCACAGTCAG TTTGGAGATT TCTCAGAGAA CTAAGAGTGG 

20051 AACTACCATT AGACCCAGCA ATCTGATGGC TGGATATACG GCCAAAGGAA 

20101 AATAAATCAT TCTGCCAAAA GAACATATGT ACCTGTATGT TCATTGTGGC 

20151 ACCATTCACA ATAGCAAAAA CATTGAATCA ACTCATGTGC CCATCAGTGG 

20201 CGGACTAGAA AAGAAAAGAA AATATGGTAC ATAGCCATCA TGGAATACTA 

20251 TGCACCCATT AAAAATAATG AAATAATGTC TTTGCAACAA CATGAATGTA 

20301 GCTGGAGGGC ATTATCCTAA GCAAACTAAC ACAGAAACAG AAAACCAAAT 

20351 ACTGCGTGTT CTCACGCAGT GAGAGTGGGA GCTGAACATC AAGTACACAT 

20401 GGATGTAAAG ATGGCAACAA TAGACATGGG TCTACTAGAG GTGGTGGTGC 

20451 GGCAGGGTGG GGGTGGGGGT TGTGTGGCAG AGGAACAGCT GAAAAACTAC 

20501 CTATTTGATA CTATACCCAG CACCTGGGAA ACGGGTTCAG TCATACCCCA 

20551 AACCTCAGCA TCACACAGTA TACCTTTCTA ACAAACTTAC ACATGTATTC 

20601 TGTGATTCTA AAATAAACAT TGAAAAATAA AAAAAAAACT GACATGGTTT 

20651 GTACTGTTTA ATCTGACATA ATGGCTAGGG GAAATGAAGT CTGCAGAATG 

20701 GCTGTTTACG GATGTTGTTG TTGTTGTTGA GATGAGGTCT CACTATGTTG 

20751 CCCAGGCTTG AACTCCTGGC CTCAAGCAGT CCTCCTGCCT TGACCTCCCA 

20801 AAATGTTGAG ATTACATGCA TGAGCCATTG CCAAAACGGC TATTTGGATT 

20851 GCTGTTAAGG TTATTACATT CTCTGTGTAG TAAGACCTTG AAGGAGAAGG 

20901 ATTTGAGATC AGGAGTTTAA GAAAAAATGT TAATCTAGGA AGAGAGGATA 

20951 ATTTCTGTGT TTGGCCAGTT GCAATGGCTC ACGCCTGTAA TCCCAGCGTT 

21001 TTGGGAGGCC GAGCTGGGCA GATCACTTGA GCTCAGGAGT TTGAGAAGAG 

21051 CCTGGGCAAC ATGGTGAAGC CCCGTCTCTA CTGAAAATAC ^AAAATTGGC 

21101 TGGGCGTGGT GGCAGGTGCC CACTGTAATT TCAGCCACTC AGGAGACTGA 

21151 GGCAGGAGAA TTGCTTGAAC CTGGAAAGTG GAGGATATGG TGAGCCTAGA 

21201 TCGCGCCACT CTACTCCAGC CTGGGAGACT CCATCTCAAA AAAAAAAGAA 

21251 AACAAGATGC TGAAATGAAG TAATTACCAC AGTCAATGTG ATCCTATAAC 

21301 TTTGTTTTCT TTTAGAGATG GGGTCTCCCT CTGTCACCCA GGCTGGAGTG 

21351 CAGTGGTGCA TCATAGCTTT CTGCAGCCTC CACCTCCTGG GCTCAGGTGG 

21401 TCCTCTTGCC TCAGTGTTCC GAGTAGTTAG GACTGACTGC AGGTGCATGC 

214 51 TGCTATGCCT GGCTAACTTT AAAATTTTTT TGTAGAGGCG GGGTCTTGCT 

21501 ATGTTGCCTA GGCTGGTCTC CAACTCCTGA TCTCAATCAG TCCTCCTGCC 

21551 TACCTTCCCA GAGCGCTGGG ATTACAGGTG TGAGCCATCG CACCTAGCCA 

21601 ATCTCATAAC ATTTTATGAC TAGCAAACTT AGTAGTTCTG ATTCAGGCAT 

21651 AAATCAGTTG GTGGGGTTAT ACAAGGTTGG GTGAGTTTTT CTAGATTTCT 

21701 AAGAGACCAT GTTGAAATAC TTGGCTCTGT CTCAGTAAGG GATAGAGAGA 

21751 AGCAAAGGTG TGGGTAAAGG TTATGAGCAG ACATGTAAAG GGGGCAAATT 

21801 AAAGTGTTTA GGGAAGGTGA AACAGTTCCA AATCATAATA TAGATCCCAA 

21851 GCCTCACCAA GAAGTGAAGG AGAGGGTAAT TGTGCGATAG TTCTCAGGAC 

21901 TGAGACCTCA AGGTGTAAGA CGAATCTTTA TTGTGGGTGG TCCTCTTTGA 

21951 GAAAAAGAAC AAAAAAGAAA AATGTGAAAT GGGTGCTAAA TTTCAGAGCA 

22001 GAATATACAT ATATGTGTAT GTGTGTGTGT ATATATATAT ATATTTTTAT 



FIGURE 3, page 7 of 17 



22051 
22101 
22151 
22201 
22251 
22301 
22351 
22401 
22451 
22501 
22551 
22601 
22651 
22701 
22751 
22801 
22851 
22901 
22951 
23001 
23051 
23101 
23151 
23201 
23251 
23301 
23351 
23401 
23451 
23501 
23551 
23601 
23651 
23701 
23751 
23801 
23851 
23901 
23951 
24001 
24051 
24101 
24151 
24201 
24251 
24301 
24351 
24401 
24451 
24501 
24551 
24601 
24651 
24701 
24751 
24801 
24851 
24901 
24951 
25001 
25051 
25101 
25151 



GTAGCAAATT CCAGTTTACA AAGGGCTTTT ATGTGTTTTG CATCATTACA 
ACAGTTCTGT GATGATGTGG ATGTGGTGGA TATTGTAATT CCACATCCCA 
GAGGATAAAA TTGAGGCACA CCATAAGGTA GCTGACCAGA GATCATGCAG 
TATATGGTAG AATGGCAACT TGAGGCCAGT TCTTAGGTGT GTTTGTTGTC 
CTTATTCAGA ATGGAACAAT GTGGTTTATT GTACAAAAAT TTAAAAATGA 
ATGTCGAAAA GTAGAAATTA TTACCCAAAT CTCACCCTCT GTGGTTGTTT 
GCTATGTGAT CTTCCAGACT CACATATACA TGTAGATAAT ttTTTTTTTT 
TTTTTTGAGA CGGAGTCTTG CTCTGTCGCC CTGGCNNNNN NNNMNNNNNN 
NNNNNNNNNN NNNNNNNNNN NWNNNNNNNN NNNNNNNNNN ^3NNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNCAAAAAT TAGCTGGGCG TAGTGGTGGG CGCCTGTGGT CCCAGCTACT 
TGGGCAGCTG AGGCAGGAGA ATCGCTTGAA CCTGGGAGGT GGAGGTTGCA 
GTGAGCCAAG ACCATGCCAC TGCACTCCAG CCTGGGCAAC AGAGCAAGAC 
TCTTGTCTTA AAAAACAAAA AAGTGTACCC AGTTGAGCTG ATTCTTTATC 
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TTTTTTTCAC TGGAGAACTA AGTATACAGG TGAGAAAAGA CGAGATATTT 
ATACCCGAGA GAATTGATGG TGAAATCCAT TTTTTTGGAT CAGAACTTCC 
CCAAACAGTG TCCTTCAAAT AGGGTTCAGG GGTGCTAAGA TATTTATCCC 
CTCAACCCTT GGGGTTCACT CCAGTATGGC ATATAAATAT TGTATCACTT 
TCTATGTGTG GGGAGCAGTG CTCCAGGTGA CCTTCCTTCC TTTCCTTCTA 
GGGGAGGAAG TTGCCTTGTA CTGTGCCAAA TATCTTCCTG ATATCATCAA 
AGATCAGAAG GCCTACAAGG AAGGCAAGCT ACAGAAGGTC TGTCTGCTTA 
CACCGCCCAT TCCTCACTTG TGTAGGCTTT TCCCTTGTTC TCTAGCCCTT 
GGGCTTTTCC TTTCTTTTTG TCCTCTAGCT GCTGCTGCTT ATTTACTCTT 
GAAGAATTCT GTTCCTAAAA CGAGCTTATT GGCCGCCTTT TAGACTTGCC 
TTATTATTCC TAGGCCTCTG AGCTGTTTTT ATCTGTGAGT GTCTCTTAGT 
GTGGTGGCTC ACACTCTTAA TTTGTATTCC ATCCTTGTGC TCAGGATTGT 
ATATAGGGAG TTCATTTTGT ACTAGTCTTA GACTATTTTG CTTATATTCA 
GGCTTTAGAA GATGCCTTCT TGGCTATTGA CGCCAAATTG ACCACTGAAG 
AAGTCATTAA AGAGCTGGCA CAGATTGCAG GGCGACCCAC TGAGGATGAA 
GATGAAAAAG AAAAAGTAGC TGATGAAGAT GATGGTGAGT GTGGCATCCC 
TTGTTTGAGG GGAAATCAGC ATTTTAAGAA ATATTCTTTA ATATTACTTA 
TCAATTCTAA GATAGGATGG CTTTCTAGGG ACCTGGGGAG TCCTTATGTT 
AAAGAAACCT ATGATGTTCT CCTGCATTGT ATGTGGTTAT GAAAAGGAGG 
GAGAGAATTA TCTTCGTTGA GTGGCATCTG AGCTGTAAGC ATTGTATATA 
CATTATCTTT TGTCATTGTG ATCGGGTCTT CCTGGTTCCT GCTAGTATTT 
ATGTGCTTTT TTTTCCCCTC AAGACTGGAG CAGTTATTAG CCCCAATAGC 
CAATCATTAA GCCTAAATCC TAATTCACAG TAGCATTGTG GGCTTCCTGG 
ATCCTCAGCC AGAATAGGGT TTTTACAACT TAACAATAAA AAATGAGACG 
TCAGAGGGGA AGTATAGTAA CTAGTGTTGT TTTGATTAAG AAGGGGATGA 
AACACAAAAA CCAAAAGAAG TCTGTGGAGG AGGAGGAGCT AGGGCATGTT 
CTTCTGAGAC TTGAGCGAGA GGAACCTTGG GAGTGGGAGG TTGTGGGGAA 
GTTAGAGGCT GCAAGGGCTG TTGAGGTAGT GAGAGGGACG GATCCCATGA 
GGAGTCTGGC ATGGGGGCTC TGATTTAGCC TCTTCCCTGC AGTGGACAAT 
GAGGAGGCTG CACTGCTGCA TGAAGAGGCT ACCATGACTA TTGAAGAGCT 
GCTGACACGC TACGGGCAGA ACTGTCACAA GGGCCCTCCC CACAGCAAAT 
CTGGAGGTGG GACAGGCGAG GAACCAGGGT CCCAGGGCCT CAATGGGGAG 
GCAGGACCTG AGGACTCAAC TAGGGAAACT CCTTCACAAG AAAATGGCCC 
CACAGCCAAG GCCTACACAG GCTTTTCCTC CAACTCGGAA CGTGGGACTG 
AGGCAGGCCA AGTTGGTGAG CCTGGCATTC CCACTGGTGA GGCTGGGCCT 
TCCTGCTCTT CAGCCTCTGA CAAGCTGCCT CGAGTTGCTA AGTCCAAGTT 
CTTTGAGGAC AGTGAGGATG AGTCAGATGA GGCGGAGGAA GAAGAGGAAG 
ACAGTGAGGT AAGGGCCTGT GAGGGCAGGC AGATGCTGAA GTTGCAGAGA 
GGTCCTGTTT GGTTGCCGTC TGTAGTTTTC AACTCTCTTT CCTTCTCCTA 
TTTTGACATC ATCCCCCAAG ACCCACTGTA TTCTAAGCTT TAGTCTTGAA 
TTCATTGAGC TCCATCATCA CAGGTACCAT TTGCCTTTTT ACCTCTTCCT 
TTGTTGGTAC TATAACAAGC AGATCTAGTT CTGGCTTTTC AGAGTCTGTC 
TCCTAGAGAG AGAACAAGGA GATAGTTGTT ACCTTGGCTA GTTGACTGTT 
TTCTTCTCTG GAAAATTTAT TTTCTGGCCA CAGTGCCTGA AAGATATTTT 
TGGCTGGCAG CCCTTGCCTT GTCCTGGGCT TTTTGCTAGT GACTGCTAAG 
CCCAGTTCAG GATGTCAGTT GTACTCATGC TAGCCCTTTC CATCCCCCCA 
ATTTTCATGA CCATATACTT GTATCTTTCA GTGTTTTGAG GACCTGTGTT 
CAGTCAGGAC CTCTTGATTC TGAGTATGAG CTGTGGGGAG GGAGGGGATC 
ATCCCAGTCT CAGCAGTCTG GGATCCTCCC CCTGGCAGGA ATGCAGCGAG 
GAAGAGGATG GCTACAGCAG TGAGGAGGCA GAGAATGAGG AAGATGAGGA 
TGACACCGAG GAGGCTGAAG AGGACGATGA AGAAGAAGAA GAAGAGATGA 
TGGTGCCAGG GATGGAAGGC AAAGAGGAGG TGTGTGGGGA AGGGGAGCAA 
TGAGTCTTGA AAAGCCACAA GGCAGGTGTG AATCCCCTAA TTTTGATTTT 
GAGACAGGGG ATCCCCCTGA TACTTTAGGA TGGAAGTAAT AGTCATGGGG 
ATTTATTCTG CAAGGGGAAT GAGATGGTAA GCCTTTGGGG TTGAATTATC 
TAAAAACAAG GGAGAGGGAG TGTGCTGCTG TCTCTAGAAA GATGAAATGT 
GTGCTTCTCC TGTTTGTTAA AGCTCTTTTG GGGGTCCCAG TGAAAGCAAG 
CATAGGTGAA CGATCAGGAG CACATCAGTG AGGAACGCAT GTTCAGAAGC 
CCCCATGATG CTCCTTTTCT TCCTCTTAAG CCTGGCTCTG ACAGTGGTAC 
AACAGCGGTG GTGGCCCTGA TACGAGGGAA GCAGTTGATT GTAGCCAACG 
CAGGAGACTC TCGCTGTGTG GTATCTGAGG CTGGCAAAGC TTTAGACATG 
TCCTATGATC ACAAACCAGA GGATGAAGTA 'GAACTAGCAC GCATCAAGAA 
TGCTGGTGGC AAGGTCACCA TGGATGGGCG AGTCAACGGG GGCCTCAACC 
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28351 
28401 
28451 
28501 
28551 
28601 
28651 
28701 
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28801 
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FEATURES: 

Start: 

Exon: 

Intron: 

Exon: 

Intron: 

Exon: 

Intron: 

Exon: 

Intron: 

Exon: 

Stop: 



TCTCCAGAGC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
WNMNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
TACTAAAAAA 
CCTGTAATCC 
CGGGAGGCGG 
TGGGCTACAA 
TTAAAATAAA 
GGCAGGGTTG 
CACTAGATTA 
CCTGCCACAA 
TTCCTTTACT 
CTGAGGAACA 
ACTGACGACC 
ACTGGCAGAA 
GGACTCAGGG 
GTGGGTGAAG 
TTTACACTGG 
TAGTGAGACT 
GAGCAGCCAG 
ATGAAAATGG 
ACCAGGGTGG 



CATTGGTAAG 

NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNKN 
NNNNNNNNNN 
NNNNNNNNNN 
AAAAAAAAAT 
CAGCTACTCA 
AGGTTGTAGT 
GAGTGAAACT 
AAGACCAGTG 
AAACTAGAAA 
TGGAACTTTG 
ACACAAGTCT 
CCAGGGGACC 
GATGATTTCA 
ATGAATTCAT 
TGCCCTAAAT 
CACCTCTAGG 
AGCACCCAGA 
CCTGGGCCAC 
GGAAGATTCT 
GAAGTTGTAG 
GGAGCTTCGG 
AGAAGAGAGG 



GGCCAAGAAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ACAAAAATTA 
GGAGGCTGAG 
GAGCCGAGAT 
CCGTCTCAAA 
AGTGACTTTC 
AACTAGGACT 
TAAAGAAGGG 
GTGGGAAGTA 
ACTTCTATAA 
GCCCTTCCTG 
GGTCATTGCC 
TCCCCTTTCT 
ATTAGAGCC7 
CTAAGGCAGA 
CACCTCTGTC 
GACTGTTGTT 
ATTTCATTCA 
TTATTGTCAT 
GTGTCTGGTC 



CTGGGAAAGA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNATGGTGAA 
GCTGGGCATG 
GCAGGAGAAC 
CGTGCCATTA 
AAAAAAAAAA 
TTAAGGTTCA 
TAGGACTCAG 
AAATGT^TGG 
TCCAAACTGC 
GAGAAACT^G 
ACATCAAGGT 
TGTGATGGCA 
GCAGCATGTC 
AGGCAGACCT 
GCTGAGAATT 
CATACTCCTC 
CTTGACCCCA 
ATCAAAGATC 
CCATTGTGGA 
TGCACAGCCA 



NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
ACCCCATCTC 
GTAGTGCAAG 
CGCTTGAATC 
CACTCCAGCC 
CAAAAAAGAC 
GCAGTCTGGT 
TTCCCCATTC 
CAAGGTTTGA 
TCATCAACCA 
AACCTGCCAC 
GCTGACTCTC 
TCTGGTGAGC 
TTCTCTTATA 
AGGCCTCTTG 
TCTGTAGTTA 
TACGCTGCCT 
GGAATGTGAT 
AGCCAGCGTG 
AGAGGTGAGT 
GGGTT (SEQ ID NO: 3) 



2134 

2134-2326 

2327-25451 

25452-25537 

25538-25851 

25852-25984 

25985-29164 

29165-29294 

29295-29541 

29542-29647 



CHROMOSOME MAP POSITION: 

Chromosome 2 

ALLELIC VARIANTS (SNPs) : 



DNA 

Position Major Minor 

2944 A 

3232 G A 

7236 T A 

8187 T AG 

8187 T - G 

8187 - AG 

9172 C A 

9173 T 
9180 T 

13351 T C 

15100 G C 

15255 A C 

18996 A C 

18996 A C 

18996 A C 
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T 
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AGATGTGAATAACTTTTTAAATTTTAATATTTAAAATATCTGATGTGGGAAGCCTCTTTT 
GGCTAGGAGTTTGACAGTGAAAGGAACCCCGGGCAGAGTCTGTTTCACATTTTGGTTGCC 
TGGCCTTGGGCTCTTGCATGTTAATTTCAGAGGCTGGACCCGACCTCCAGGAGTTGTCAC 
TCATTTGCACTCTTTTCAGGGCCTTTTACTAACTTCGGAAAACTTGAATTATGTCAGTCC 
CTAGGTTTTCCTTTTTATTATGCTTCGTTTTCTTTCTCTCTTAGGATTTCTCTAAAACTT 
[A,-] 

ATCAGTAATTCTCATTTGCCTGTAATTGTAGTTTATAGTTTCTTTGATTGTAAGTCATTT 
CATCAATTTTTTTCATCACAACCTACCTACAAAGGGCTTTTCTAGAAAATTTTACTCTGG 
ACAAAAGGGGAAAAGAAAAATATTGGGGGGAAAGTAGTAGTATTAGGTAAAACTTGATGT 
GAAACTACAAAAGAGAAGAGGGAAAACTGCGGTAGGGAGGAAAGGGAGGAAGACGGGTTA 
ACCGTGGCTTTGTGAAGAGCATTCTGAAGTCTAGGCAAAAGGGCCAGGGAAATACTCTGT 

TCTCTAAAACTTAATCAGTAATTCTCATTTGCCTGTAATTGTAGTTTATAGTTTCTTTGA 
TTGTAAGTCATTTCATCAATTTTTTTCATCACAACCTACCTACAAAGGGCTTTTCTAGAA 
AATTTTACTCTGGACAAAAGGGGAAAAGAAAAATATTGGGGGGAAAGTAGTAGTATTAGG 
TAAAACTTGATGTGAAACTACAAAAGAGAAGAGGGAAAACTGCGGTAGGGAGGAAAGGGA 
GGAAGACGGGTTAACCGTGGCTTTGTGAAGAGCATTCTGAAGTCTAGGCAAAAGGGCCAG 
[G,A] 

GAAATACTCTGTCTGGTATTGAGGGTTTCTCCACCTACCGGGTGGGCTTCAGGTAACAGC 
GAAATACTGTCTCCCTTGGGAATTGTTTCAGATCCCTCGCTCCTCCTGTGGTTAGCTCTG 
GAATGCCAGTATGAACTTCAATGTTTTGTTTTCCGATTCAAATTTTATATTCATAACTGA 
CCTTAATAACAATTTTACAATTAGGTATAAAATTTCAGGATCCTAGTGTATCCTATAGTT 
CATCTCATCTGCTTTGGCTCCCTTTTTTTTTTTTTGAGACAGAATTTCGCTCTTGTTGCC 

GACAGGATTTCACTATGTTGGCCAGGCTGGTCTCAAGACTCCTGACGTTGTGATCCACCT 
GCCTCGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCTACCGTGCCCGGCCAGGAATT 
TTTTGTGCTATAAATCATATTTTCCTTTTATTAAAGGCAGTGTCAATATCTATAGTATAA 
TTTTGAGGAGGCTGGCTATTTATTGCTGTGTAGAAGCTGGCTTATTAGTGGTCAAGGGGT 
CATCTAGAATTGACTATAAAGATAGTATTGAGCAGAAAATTCTTAAAATACCTGCATATT 
[T,A] 

GTTTCAGTCATTAAATTAATGGAAAAAATATAAAAAGAAATATCACAAGTATGCTATGGG 
GTTCTACCTTAGGGCATTGAAGGTTGAAAAACATTTATTTTCTTATCTTCAGATTAGCAT 
CTCATATCAATAGCCAATAGCTTAAAGCGCTTTTTACTTACTAAACCAGGTCAGAATTTC 
TCTCTCTCTCTCTTTTTTTTTTTTAGACAGAGTCTCTGTCACCCAGTCTAGAGTGCAGTG 
GTGTGATCTCGGCTCACTGGCCTTTCTGTTTAAGTGATTCTCCGGCCTCAGCCTCCTGAG 

GTTGCCCAGGCTGGAGTGCAATGGTGCCATCTCGGCTCACTGCAGCTTCTGCCTCCTGGG 
TTCTAGCAATTTGCCTACCCCAGCCTCCCAAGTAGCTGGGATTACAGGCACTCGCCACCA 
TGCCCAGCTAACTTTTTTGTATTTTTAGTAGAGAAGGGGTTTTCCCATGCTGGCGAGTCT 
GGTCTTGAACTTCTGGCCTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGAATT 
ACAGGCATGAGCCACTGTGCCCGGCCTTTTTATTTTTTAAATTATTTATGTATTTATTTT 
[T,A,G] 

AGACAGGATCTCACTCTTGCCCATGCTTGAGTGGTAAGGAGTATGGGATTTCTTGTGCCG 
GTTCCTTCACATCCTCACTACACTTATCTGCCTTTACAGTGGCTCGATCATAGTTCACTG 
CATAGCCTTCTGGGCTCAAGGGGTCTTCCAGCCTCAGCCTAATATAGGCACATGCCACCA 
TGACTGGCTAATTTTTTTTTTTTAAGTTTTTTTTTGTAGAGATAGGGCCTTGCAGTGTTG 
CCCAGGCTGAGGAATTTTATTTATGTTTATTTTATTTATTTATTTATTTATTTATTTATT 

GTTGCCCAGGCTGGAGTGCAATGGTGCCATCTCGGCTCACTGCAGCTTCTGCCTCCTGGG 
TTCTAGCAATTTGCCTACCCCAGCCTCCCAAGTAGCTGGGATTACAGGCACTCGCCACCA 
TGCCCAGCTAACTTTTTTGTATTTTTAGTAGAGAAGGGGTTTTCCCATGCTGGCGAGTCT 
GGTCTTGAACTTCTGGCCTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGAATT 



FIGURE 




ACAGGCATGAGCCACTGTGCCCGGCCTTTTTATTTTTTAAATTATTTATGTATTTATTTT 
[T,-,G] 

AGACAGGATCTCACTCTTGCCCATGCTTGAGTGGTAAGGAGTATGGGATTTCTTGTGCCG 
GTTCCTTCACATCCTCACTACACTTATCTGCCTTTACAGTGGCTCGATCATAGTTCACTG 
CATAGCCTTCTGGGCTCAAGGGGTCTTCCAGCCTCAGCCTAATATAGGCACATGCCACCA 
TGACTGGCTAATTTTTTTTTTTTAAGTTTTTTTTTGTAGAGATAGGGCCTTGCAGTGTTG 
CCCAGGCTGAGGAATTTTATTTATGTTTATTTTATTTATTTATTTATTTATTTATTTATT 

8187 GTTGCCCAGGCTGGAGTGCAATGGTGCCATCTCGGCTCACTGCAGCTTCTGCCTCCTGGG 
TTCTAGCAATTTGCCTACCCCAGCCTCCCAAGTAGCTGGGATTACAGGCACTCGCCACCA 
TGCCCAGCTAACTTTTTTGTATTTTTAGTAGAGAAGGGGTTTTCCCATGCTGGCGAGTCT 
GGTCTTGAACTTCTGGCCTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGAATT 
ACAGGCATGAGCCACTGTGCCCGGCCTTTTTATTTTTTAAATTATTTATGTATTTATTTT 
[-,A,G] 

AGACAGGATCTCACTCTTGCCCATGCTTGAGTGGTAAGGAGTATGGGATTTCTTGTGCCG 
GTTCCTTCACATCCTCACTACACTTATCTGCCTTTACAGTGGCTCGATCATAGTTCACTG 
CATAGCCTTCTGGGCTCAAGGGGTCTTCCAGCCTCAGCCTAATATAGGCACATGCCACCA 
TGACTGGCTAATTTTTTTTTTTTAAGTTTTTTTTTGTAGAGATAGGGCCTTGCAGTGTTG 
CCCAGGCTGAGGAATTTTATTTATGTTTATTTTATTTATTTATTTATTTATTTATTTATT 



9172 AAGGATATTTTACACAAAGTTCTATTGTCTGATTCCTTAGCAGTTGTTACTACTGTTTCC 
CTAACCTCTAATCTTCTATTGGGTTATTAGTCTTAGAATTGAATTTTGAGAGGTAAGGGC 

Q TTGAATTTGAACATAGAAATTTATACAGGTCTGATCAGTAGTTCTTGACATTGTATTATC 
TGGAAACAAAT CTTTAGAACTGAGCTTAAGATGTTTAATGACATTTTGTAGACAGAGTAT 
GATTTCAGTGTAGTTGTTTTTGTTTCTTTCTAGATCTAGTTCAGAGATGAAGTATATCAA 

P [C,A] 

M TTTTTTTTCCTTTTTGACCCAATGCTAGCAGAAAAACAACACCTTTTAATCATATTTAGT 
lij ATTTGAAAATGTGTATACAGGTTCCTTTTTATTTTATTTATTTCTTACAGGTTCCTTTTT 
iVj AATCAGCTTTATTGAGATAGAGTTCATATACTGTATGGTTCATACCACATATGGTTCATA 
5^ TACCATACAGTATATGAACTCACTTTAAGAGTATAATTCAGTGGGTTTAAGGGTATAATT 
i-= CATTCATTTTAAGGGTATAATTCAGTGGCTTTTAGTATATTTTCTTTTTTTTCTTCTTTT 

9173 AGGATATTTTACACAAAGTT CTATTGTCTGATTCCTTAGCAGTTGTTACTACTGTTTCCC 
1=^ TAACCTCTAATCTTCTATTGGGTTATTAGTCTTAGAATTGAATTTTGAGAGGTAAGGGCT 

TGAATTTGAACATAGAAATTTATACAGGTCTGATCAGTAGTTCTTGACATTGTATTATCT 
Jjf GGAAACAAATCTTTAGAACTGAGCTTAAGATGTTTAATGACATTTTGTAGACAGAGTATG 
ry ATTTCAGTGTAGTTGTTTTTGTTTCTTTCTAGATCTAGTTCAGAGATGAAGTATATCAAC 



TTTTTTTCCTTTTTGACCCAATGCTAGCAGAAAAACAACACCTTTTAATCATATTTAGTA 
TTTGAAAATGTGTATACAGGTTCCTTTTTATTTTATTTATTTCTTACAGGTTCCTTTTTA 
ATCAGCTTTATTGAGATAGAGTTCATATACTGTATGGTTCATACCACATATGGTTCATAT 
ACCATACAGTATATGAACTCACTTTAAGAGTATAATTCAGTGGGTTTAAGGGTATAATTC 
ATTCATTTTAAGGGTATAATTCAGTGGCTTTTAGTATATTTTCTTTTTTTTCTTCTTTTT 

9180 TTTACACAAAGTTCTATTGTCTGATTCCTTAGCAGTTGTTACTACTGTTTCCCTAACCTC 
TAATCTTCTATTGGGTTATTAGTCTTAGAATTGAATTTTGAGAGGTAAGGGCTTGAATTT 
GAACATAGAAATTTATACAGGTCTGATCAGTAGTTCTTGACATTGTATTATCTGGAAACA 
AATCTTTAGAACTGAGCTTAAGATGTTTAATGACATTTTGTAGACAGAGTATGATTTCAG 
TGTAGTTGTTTTTGTTTCTTTCTAGATCTAGTTCAGAGATGAAGTATATCAACTTTTTTT 
IT.-] 

CCTTTTTGACCCAATGCTAGCAGAAAAACAACACCTTTTAATCATATTTAGTATTTGAAA 
ATGTGTATACAGGTTCCTTT7TATTTTATTTATTTCTTACAGGTTCCTTTTTAATCAGCT 
TTATTGAGATAGAGTTCATATACTGTATGGTTCATACCACATATGGTTCATATACCATAC 
AGTATATGAACTCACTTTAAGAGTATAATTCAGTGGGTTTAAGGGTATAATTCATTCATT 
TTAAGGGTATAATTCAGTGGCTTTTAGTATATTTTCTTTTTTTTCTTCTTTTTTTCTTTT 

13351 TTTTCCCCCATTGAATTATCTTGGCATCATTGTCAGAGATAAATTGACCGTAAATGTGAG 
GGTTTTATTTCTGAACTCTCAAGTCCATTTCATTGGTCTACATGTCCCTATGCCAGTAAT 
ACACTATCTTGGTTACTGTAGCTTTTTAGTACGTTTTGAAATGTTTTTAAAATTTGTTTT 
TCATCTAAATTTTAGGATTAATTTGTCAATTTCTGCACAAAAGGCACCTGGGTTTCTATA 
GGGGTTATGCAGAATCTGTAGATCAACTGGGGGAGTATTACAGGCATGAGCCACCGTGCC 
[T,C] 

GGCTGACTGAGTTTTTCATAGATGTACTCTATCAGGTTTAGGAAGTTCCCTTTTATTCCT 
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AGGTTGTTGAGTCTATTTTATATTACTTTTTTAGAGACAGTCTTGCTCTGTCCCTCAGGC 
TGTAGCACAGTGGCTCAATCATAGCTCACTGCAGCCTTGAACTCCTAGGTTCAAGAGATC 
CGCCTGCCTCAGCCTTCTTAGTAGCTGGGATTACATGCATGCACCACCATACTGGGCTAA 
TTTTTTAAAATTTTTTATAGAGACAGGGTCTTATTACTATGTTGCCCAGACTGGCATTGA 

GTTGTGTACCCAGTTTCCAGAACTCTTCATCCTACAGAACTGAAACTCCATACCCATTAA 
ATGAGTCCCCATTCTCTTTCCCCCAGCTCATGGCAAACAGCATTCTATTTTCAGTCTCTA 
TGAATTTGATTAGTTTAGATACTTCATACTGTAAGTGGAATCATATGGTATTTGTCTTTT 
AGTGACTGCCTAATTTAAAAAAAATTTTTTTGAGACGGAGTCCTGCTCTGTCGCCCAGGC 
TGGAGTGCAGTGGCACCATCTCTGCTCACTGCAACCTCCACCTCCCAGGTTCAAGTGATT 

[G,C] 

TCCTGCCTCGGCCTCCCACGTAGCTGGGATTACAGGTGCTCGCCACAACACCCGGCTAAT 
TTTTGTATTTTTAGGTAGAGACTGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCC 
TGACCTCAAATTATCCACCTGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCC 
ACTGTGCCCAGCCTCCATGTTGTTTTTCACAACACCTGTATCATTTACATTTCCACCAAC 
AGTACACAAGAATTTCAGTTTCTCCACATCCTTGCTAGCAGTTGTTATTATCTGTTTTTT 

TGGAATCATATGGTATTTGTCTTTTAGTGACTGCCTAATTTAAAAAAAATTTTTTTGAGA 
CGGAGTCCTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCACCATCTCTGCTCACTGCAAC 
CTCCACCTCCCAGGTTCAAGTGATTGTCCTGCCTCGGCCTCCCACGTAGCTGGGATTACA 
GGTGCTCGCCACAACACCCGGCTAATTTTTGTATTTTTAGGTAGAGACTGGGTTTCACCA 
TGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAAATTATCCACCTGCCTTGGCCTCCCA 

[A,C] 

AGTGCTGGGATTACAGGCGTGAGCCACTGTGCCCAGCCTCCATGTTGTTTTTCACAACAC 
CTGTATCATTTACATTTCCACCAACAGTACACAAGAATTTCAGTTTCTCCACATCCTTGC 
TAGCAGTTGTTATTATCTGTTTTTTTTTAATGGTTTCTTTTTTCCTTTTTCTTTTTTTTT 
TTTTTTGAGACGGTCTTATTCTTGCTCATGCTGGAGTGCAGTGGTGCAATGTGATAGCTC 
ACTGCAGCCTCAACCTCTGGGCTCAAGCAGTCCTGCCACCTCAGCCTCCACATAGGTGGG 

GTAGTGGCTCATGCCTAAATCTCAGCACTTTGGGAGGCTGAGGCGGGAGGATCACCTGAG 
GTCGGGAGTTTGAGACCAGCCTGGGCAACATGGTGAAACCCCGTGTCTACTAAAAATATA 
AAAACTAGCCAGTCATGGTGGCACACACCTGTAATCCCGGCTACTTGGGAGGCTGAGGCA 
TGAGAATCACTTGAACCTGGGCTGCGGAGGTTGTAGTAAGCTGAGATCATGCCACTGTAC 
TCCAGCCTGGGTGACAGGGAGACATTCTCTCTCTCAAAAAAAAAAAAAAAACAAAAAAAA 

AAAACAAACCAACAAAACAAAGTAATCCAGGAACAACAACATGATGAAGGACTGCATGCA 
GGACTCAGTGATGGATGGTGGAAGACAGCCAGGAAGTTAAGCATGACTCTGGTATTAAGT 
GTTGTCTGGGAGAGTTAAGATTCCATTTACAGAAATAAGACCTGTAGGGGAAGCTCTTGA 
rprp,p,p,p^Y^rp,j,j>,j-.j.QC;^Q^(;;TGCTGATTTCCTGATTACATGTGTTAAGTTTGAGGTATAGA 
GAGAAAGAACATCCTGGCCGGGTGCAGTGGCTCACACCCGTAATCCCAGCACTTTGGGAG 

GTAGTGGCTCATGCCTAAATCTCAGCACTTTGGGAGGCTGAGGCGGGAGGATCACCTGAG 
GTCGGGAGTTTGAGACCAGCCTGGGCAACATGGTGAAACCCCGTGTCTACTAAAAATATA 
AAAACTAGCCAGTCATGGTGGCACACACCTGTAATCCCGGCTACTTGGGAGGCTGAGGCA 
TGAGAATCACTTGAACCTGGGCTGCGGAGGTTGTAGTAAGCTGAGATCATGCCACTGTAC 
TCCAGCCTGGGTGACAGGGAGACATTCTCTCTCTCAAAAAAAAAAAAAAAACAAAAAAAA 

tA,C] 

AAAACAAACCAACAAAACAAAGTAATCCAGGAACAACAACATGATGAAGGACTGCATGCA 
GGACTCAGTGATGGATGGTGGAAGACAGCCAGGAAGTTAAGCATGACTCTGGTATTAAGT 
GTTGTCTGGGAGAGTTAAGATTCCATTTACAGAAATAAGACCTGTAGGGGAAGCTCTTGA 
YrpY,pYT.pY-p-prj,.p,f(^Q^(;^CTGCTGATTTCCTGATTACATGTGTTAAGTTTGAGGTATAGA 
GAGAAAGAACATCCTGGCCGGGTGCAGTGGCTCACACCCGTAATCCCAGCACTTTGGGAG 

GTAGTGGCTCATGCCTAAATCTCAGCACTTTGGGAGGCTGAGGCGGGAGGATCACCTGAG 
GTCGGGAGTTTGAGACCAGCCTGGGCAACATGGTGAAACCCCGTGTCTACTAAAAATATA 
AAAACTAGCCAGTCATGGTGGCACACACCTGTAATCCCGGCTACTTGGGAGGCTGAGGCA 
TGAGAATCACTTGAACCTGGGCTGCGGAGGTTGTAGTAAGCTGAGATCATGCCACTGTAC 
TCCAGCCTGGGTGACAGGGAGACATTCTCTCTCTCAAAAAAAAAAAAAAAACAAAAAAAA 

[A,C] 

AAAACAAACCAACAAAACAAAGTAATCCAGGAACAACAACATGATGAAGGACTGCATGCA 
GGACTCAGTGATGGATGGTGGAAGACAGCCAGGAAGTTAAGCATGACTCTGGTATTAAGT 
GTTGTCTGGGAGAGTTAAGATTCCATTTACAGAAATAAGACCTGTAGGGGAAGCTCTTGA 
Y^rp^,p^^,p^,pY^.pQ(;;;^Q;^Crp(^CTGATTTCCTGATTACATGTGTTAAGTTTGAGGTATAGA 
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GAGAAAGAACATCCTGGCCGGGTGCAGTGGCTCACACCCGTAATCCCAGCACTTTGGGAG 



TAGCCATCATGGAATACTATGCACCCATTAAAAATAATGAAATAATGTCTTTGCAACAAC 
ATGAATGTAGCTGGAGGGCATTATCCTAAGCAAACTAACACAGAAACAGAAAACCAAATA 
CTGCGTGTTCTCACGCAGTGAGAGTGGGAGCTGAACATCAAGTACACATGGATGTAAAGA 
TGGCAACAATAGACATGGGTCTACTAGAGGTGGTGGTGCGGCAGGGTGGGGGTGGGGGTT 
GTGTGGCAGAGGAACAGCTGAAAAACTACCTATTTGATACTATACCCAGCACCTGGGAAA 
[C,T] 

GGGTTCAGTCATACCCCAAACCTCAGCATCACACAGTATACCTTTCTAACAAACTTACAC 
ATGTATTCTGTGATTCTAAAATAAACATTGAAAAATAAAAAAAAAACTGACATGGTTTGT 
ACTGTTTAATCTGACATAATGGCTAGGGGAAATGAAGTCTGCAGAATGGCTGTTTACGGA 
TGTTGTTGTTGTTGTTGAGATGAGGTCTCACTATGTTGCCCAGGCTTGAACTCCTGGCCT 
CAAGCAGTCCTCCTGCCTTGACCTCCCAAAATGTTGAGATTACATGCATGAGCCATTGCC 

GCATGAGCCATTGCCAAAACGGCTATTTGGATTGCTGTTAAGGTTATTACATTCTCTGTG 
TAGTAAGACCTTGAAGGAGAAGGATTTGAGATCAGGAGTTTAAGAAAAAATGTTAATCTA 
GGAAGAGAGGATAATTTCTGTGTTTGGCCAGTTGCAATGGCTCACGCCTGTAATCCCAGC 
GTTTTGGGAGGCCGAGCTGGGCAGATCACTTGAGCTCAGGAGTTTGAGAAGAGCCTGGGC 
AACATGGTGAAGCCCCGTCTCTACTGAAAATACAAAAATTGGCTGGGCGTGGTGGCAGGT 
[G,1] 

CCCACTGTAATTTCAGCCACTCAGGAGACTGAGGCAGGAGAATTGCTTGAACCTGGAAAG 
TGGAGGATATGGTGAGCCTAGATCGCGCCACTCTACTCCAGCCTGGGAGACTCCATCTCA 
AAAAAAAAAGAAAACAAGATGCTGAAATGAAGTAATTACCACAGTCAATGTGATCCTATA 
ACTTTGTTTTCTTTTAGAGATGGGGTCTCCCTCTGTCACCCAGGCTGGAGTGCAGTGGTG 
CATCATAGCTTTCTGCAGCCTCCACCTCCTGGGCTCAGGTGGTCCTCTTGCCTCAGTGTT 

AGGAGTTTAAGAAAAAATGTTAATCTAGGAAGAGAGGATAATTTCTGTGTTTGGCCAGTT 
GCAATGGCTCACGCCTGTAATCCCAGCGTTTTGGGAGGCCGAGCTGGGCAGATCACTTGA 
GCTCAGGAGTTTGAGAAGAGCCTGGGCAACATGGTGAAGCCCCGTCTCTACTGAAAATAC 
AAAAATTGGCTGGGCGTGGTGGCAGGTGCCCACTGTAATTTCAGCCACTCAGGAGACTGA 
GGCAGGAGAATTGCTTGAACCTGGAAAGTGGAGGATATGGTGAGCCTAGATCGCGCCACT 
tC,G] 

TACT CCAGCCTGGGAGACT CCATCTCAAAAAAAAAAGAAAACAAGATGCTGAAATGAAGT 
AATTACCACAGTCAATGTGATCCTATAACTTTGTTTTCTTTTAGAGATGGGGTCTCCCTC 
TGTCACCCAGGCTGGAGTGCAGTGGTGCATCATAGCTTTCTGCAGCCTCCACCTCCTGGG 
CTCAGGTGGTCCTCTTGCCTCAGTGTTCCGAGTAGTTAGGACTGACTGCAGGTGCATGCT 
GCTATGCCTGGCTAACTTTAAAATTTTTTTGTAGAGGCGGGGTCTTGCTATGTTGCCTAG 

GGAGTTTAAGAAAAAATGTTAATCTAGGAAGAGAGGATAATTTCTGTGTTTGGCCAGTTG 
CAATGGCTCACGCCTGTAATCCCAGCGTTTTGGGAGGCCGAGCTGGGCAGATCACTTGAG 
CTCAGGAGTTTGAGAAGAGCCTGGGCAACATGGTGAAGCCCCGTCTCTACTGAAAATACA 
AAAATTGGCTGGGCGTGGTGGCAGGTGCCCACTGTAATTTCAGCCACTCAGGAGACTGAG 
GCAGGAGAATTGCTTGAACCTGGAAAGTGGAGGATATGGTGAGCCTAGATCGCGCCACTC 
[T,C] 

ACTCCAGCCTGGGAGACTCCATCTCAAAAAAAAAAGAAAACAAGATGCTGAAATGAAGTA 
ATTACCACAGTCAATGTGATCCTATAACTTTGTTTTCTTTTAGAGATGGGGTCTCCCTCT 
GTCACCCAGGCTGGAGTGCAGTGGTGCATCATAGCTTTCTGCAGCCTCCACCTCCTGGGC 
TCAGGTGGTCCTCTTGCCTCAGTGTTCCGAGTAGTTAGGACTGACTGCAGGTGCATGCTG 
CTATGCCTGGCTAACTTTAAAATTTTTTTGTAGAGGCGGGGTCTTGCTATGTTGCCTAGG 

GGAAGAGAGGATAATTTCTGTGTTTGGCCAGTTGCAATGGCTCACGCCTGTAATCCCAGC 
GTTTTGGGAGGCCGAGCTGGGCAGATCACTTGAGCTCAGGAGTTTGAGAAGAGCCTGGGC 
AACATGGTGAAGCCCCGTCTCTACTGAAAATACAAAAATTGGCTGGGCGTGGTGGCAGGT 
GCCCACTGTAATTTCAGCCACTCAGGAGACTGAGGCAGGAGAATTGCTTGAACCTGGATVA 
GTGGAGGATATGGTGAGCCTAGATCGCGCCACTCTACTCCAGCCTGGGAGACTCCATCTC 
[-,A] 

AAAAAAAAAGAAAACAAGATGCTGAAATGAAGTAATTACCACAGTCAATGTGATCCTATA 
ACTTTGTTTTCTTTTAGAGATGGGGTCTCCCTCTGTCACCCAGGCTGGAGTGCAGTGGTG 
CATCATAGCTTTCTGCAGCCTCCACCTCCTGGGCTCAGGTGGTCCTCTTGCCTCAGTGTT 
CCGAGTAGTTAGGACTGACTGCAGGTGCATGCTGCTATGCCTGGCTAACTTTAAAATTTT 
TTTGTAGAGGCGGGGTCTTGCTATGTTGCCTAGGCTGGTCTCCAACTCCTGATCTCAATC 

CTGTAAGCATTGTATATACATTATCTTTTGTCATTGTGATGGGGTCTTCCTGGTTCCTGC 




TAGTATTTATGTGCTTTTTTTTCCCCTCAAGACTGGAGCAGTTATTAGCCCCAATAGCCA 
ATCATTAAGCCTAAATCCTAATTCACAGTAGCATTGTGGGCTTCCTGGATCCTCAGCCAG 
AATAGGGTTTTTACAACTTAACAATAAAAAATGAGACGTCAGAGGGGAAGTATAGTAACT 
AGTGTTGTTTTGATTAAGAAGGGGATGAAACACAAAAACCAAAAGAAGTCTGTGGAGGAG 
fG,A] 

AGGAGCTAGGGCATGTTCTTCTGAGACTTGAGCGAGAGGAACCTTGGGAGTGGGAGGTTG 
TGGGGAAGTTAGAGGCTGCAAGGGCTGTTGAGGTAGTGAGAGGGACGGATCCCATGAGGA 
GTCTGGCATGGGGGCTCTGATTTAGCCTCTTCCCTGCAGTGGACAATGAGGAGGCTGCAC 
TGCTGCATGAAGAGGCTACCATGACTATTGAAGAGCTGCTGACACGCTACGGGCAGAACT 
GTCACAAGGGCCCTCCCCACAGCAAATCTGGAGGTGGGACAGGCGAGGAACCAGGGTCCC 



GENEWISE ALIGNMENT: 

gil 4505999|ref ! 1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPEL 

MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSME 
MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEVRRQGPIGW 

genomic 2134 aggtctccaagattggggggcccccctgttgacgtcgtaggaacgcagt 

tgcatcacactagcgagtgccgtctcagtcctagggtctatggagctgg 
gtccccgccgggccgcgcccgcgggccccccgacgcccgggggggcacg 

gil 4505999 1 ref I 50 DSETAMFSVYDGHGG EEVALYCA 

++ + G EEVALYCA 
PLRGGNLTEKESGGW -:G[ggg] EEVALYCA 

genomic 2281 cccggacagagaggtGGTCCTCC Intron 1 TAGGGggggtttg 

ctgggatcaaagggg <1 [2327 : 25451] -1> aatctagc 

gggcgtgggagcgag gatcgctc 

gil 45059991 ref 1 73 KYLPDIIKDQKAYKEGKLQK ALEDAF 

KYLPDIIKDQKAYKEGKLQK ALEDAF 
KYLPDIIKDQKAYKEGKLQK ALEDAF 

genomic 254 78 atccgaaagcagtaggaccaGTCTGTC Intron 2 CAGgtgggt 

aatcattaaaacaaagataa<0 [25538 : 25851 ] -0>ctaact 

attttccatggccgacgagg taatcc 

gil 4505999 I ref I 99 LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 

LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 
LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 

genomic 25870 tgaggataagggaagcgcaggccagggggagagggggg 

tctacatccaattaatcatcggccaaaaaaaatcaaaa 
gttccagctaactaggagtagactgtataaaaattatt 

gil 4 5059991 ref I 137 DNEEAALLHEEATMTIEELLTRY 

DNEEAALLHEEATMTIEELLTRY 
V : V [ gt g ] DNEEAALLHEEATMTIEELLTRY 
genomic 25984 GGTGAGTG Intron 3 CAGTGgaggggcccgggaaaaggccact 

<1 [25985:26642] -1> aaaaccttaaacctctaattcga 

ctggtaggtagtcgttagggacc 

gi 14505999 [ ref I 161 GQNCHKGPPHSKSGGGTGEEPGSQGLNGEAGPEDSTRETPSQENGPTAK 

GQNCHKGPPHSKSGGGTGEEPGSQGLNGEAGPEDSTRETPSQENGPTAK 
GQNCHKGPPHSKSGGGTGEEPGSQGLNGEAGPEDSTRETPSQENGPTAK 

genomic 26714 gcatcagcccaatgggagggcgtcgcaggggcggtaagactcgagcaga 

gaagaagccagacgggcgaacgcagtagacgcaaccgacccaaagccca 
ggctcgctcccatatgacgaagcgcctggaatgcatgattaaatccacg 

gi 1 4 505999 1 ref I 210 AYTGFSSNSERGTEAGQVGEPGIPTGEAGPSCSSASDKLPRVAKSKFFE 

AYTGFSSNSERGTEAGQVGEPGIPTGEAGPSCSSASDKLPRVAKSKFFE 
AYTGFSSNSERGTEAGQVGEPGIPTGEAGPSCSSASDKLPRVAKSKFFE 

genomic 26861 gtagtttatgcgagggcgggcgacaggggcttttgtgacccggatattg 

cacgtccacaggcacgatgacgtccgacgccgccccaatcgtcacatta 
ccactcccgatgtgacattgtctcttgtgtcctactcggtattgcgctg 

gil 4505999 (ref I 259 DSEDESDEAEEEEEDSE ECSEEEDGY 
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DSEDESDEAEEEEEDSE ECSEEEDGY 
DSEDESDEAEEEEEDSE ECSEEEDGY 
genomic 27008 gagggtgggggggggagGTAAGGG Intron 4 CAGgtagggggt 

agaaacaacaaaaaaga<0 [27059: 27638] -0>aggaaaaga 

ctgtgatgggaagactg accgagtcc 

gi 14505999 I refj 285 SSEEAEllEEDEDDTEEAEEDDEEEEEEMMVPGMEGKEE 

SSEEAENEEDEDDTEEAEEDDEEEEEEMMVPGMEGKEE 
SSEEAENEEDEDDTEEAEEDDEEEEEEMMVPGMEGKEE 

genomic 27666 aaggggaggggggagggggggggggggaagcgaggagg 

ggaacaaaaaaaacaacaaaaaaaaaatttcgtagaaa 
ctggagtgatgtccggtagctaaaaaggggaggacagg 

gi|4 506999| ref I 323 PGSDSGTTAVVALIRGKQLIVANAGD 

PG S DS GTT AWAL I RGKQL I VAN AG D 
PGSDSGTTAVVALIRGKQLIVANAGD 

genomic 27780 GTGTGTG Intron 5 AAGcgtgagaaggggcacgactaggaggg 

<0 (27780: 28130] -Ocgcaggcccttcttggaatttcacga 

tctcttaagggcgaaggggtaccaac 

gi ! 4505999 ! ref I 349 SRCVVSEAGKALDMSYDHKPEDEVELARIKNAGGKVTMDGRVNGGLNLS 

SRCWSEAGKALDMSYDHKPEDEVELARIKNAGGKVTMDGRVNGGLNLS 
SRCVVSEAGKALDMSYDHKPEDEVELARIKNAGGKVTMDGRVNGGLNLS 

genomic 28209 tctggtgggagtgattgcacgggggcgcaaagggagaaggcgaggcact 

cggttcacgactatcaaaacaaatatcgtaacggatctaggtaggtatc 
tctgatgtcatacgcttcaagtaaaaaccgtttcgccgtgaccgccccc 

gi 14505999 1 ref I 398 RAI DHFYKRNKNLPPEEQMISAL 

RAI DHFYKRNKNLPPEEQMISAL 
RAI G:Gtgggl DHFYKRNKNLPPEEQMISAL 

genomic 28356 agaGGTAAGGG Intron 6 CAGGGgcttaaaaacccggcaatgc 

got <1 [28366:29164]-1> aataagaaatccaaattcct 

act ccctgacgcgatgaggtact 

gi 14 505999 1 ref I 422 PDIKVLTLTDDHEFMVIACDGI N 

PDIKVLTLTDDHEEMVIACDGI N 
PDIKVLTLTDDHEFMVIACDGI W:W[tgg] N 

genomic 29227 cgaagcacaggcgtagagtggaTGGTGAGCA Intron 7 CAGGa 

catattctcaaaattttcgagt <2 [29295 : 29541] -2> a 

tccgggtctcctacgctcttcc t 

gi t 4 5059991 ref I 446 VMSSQEWDFIQSKISQRDENGELRLLSSIVEEL 

VMSSQEWDFIQSKISQRDENGELRLLSSIVEE + 
VMSSQEWDFIQSKISQRDENGELRLLSSIVEEV 

genomic 2954 6 gaaacggggtactaaaccggaggccttttagggg 

ttggaattattacatgagaaagatgttccttaat 
ggccgatatctaagccgttatggtgagactgagg 



CDNA 



genomic 



1 MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEDAHNCIPEL 
MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSME 
MGAYLSQPNTVKCSGDGVGAPRLPLPYGFSAMQGWRVSMEVRRQGPIGW 
2134 aggtctccaagattggggggcccccctgttgacgtcgtaggaacgcagt 
tgcatcacactagcgagtgccgtctcagtcctagggtctatggagctgg 
gtccccgccgggccgcgcccgcgggccccccgacgcccgggggggcacg 



cDNA 



genomic 



50 DSETAMFSVYDGHGG EEVALYCA 
++ + G EEVALYCA 
PLRGGNLTEKESGGW -:G[ggg] EEVALYCA 

2281 cccggacagagaggtGGTCCTCC Intron 1 TAGGGggggtttg 

ctgggatcaaagggg <1 [2327 :25451]-1> aatctagc 

gggcgtgggagcgag gatcgctc 
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CDNA 



genomic 



cDNA 



genomic 



73 KYLPDIIKDQKAYKEGKLQK 
KYLPDI IKDQKAYKEGKLQK 
KYLPDI IKDQKAYKEGKLQK 
254 78 atccgaaagcagtaggaccaGTCTGTC 

aatcattaaaacaaagataa<0 

attttccatggccgacgagg 



ALEDAF 
ALEDAF 
ALEDAF 

Intron 2 CAGgtgggt 
25538:25851]-0>ctaact 
taatcc 



9 9 LAI DAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 
LAI DAKLTTEEVIKELAQI AGRPTEDEDEKEKVADE DD 
LAIDAKLTTEEVIKELAQIAGRPTEDEDEKEKVADEDD 
2587 0 tgaggataagggaagcgcaggccagggggagagggggg 
tctacatccaattaatcatcggccaaaaaaaatcaaaa 
gttccagctaactaggagtagactgtataaaaattatt 



cDNA 



genomic 



137 DHFYKRNKNLPPEEQMISALPDI 

DHFYKRNKNLPPEEQMI SALPDI 
G:G[ggg] DHFYKRNKNLPPEEQMI SALPDI 

25934 GGTGAGTG Intron 3 CAGGGgcttaaaaacccggcaatgccga 

<1 [25985:29164]-1> aataagaaatccaaattcctcat 

ccctgacgcgatgaggtacttcc 



! : E 



»rtf 
1=4 



ST 5 



cDNA 



genomic 



cDNA 



genomic 



161 KVLTLTDDHEFMVIACDGI NVMS 
fCVLTLTDDHEFMVIACDGI NVMS 
KVLTLTDDHEFMVIACDGI W:W[tgg] NVMS 

29236 agcacaggcgtagagtggaTGGTGAGCA Intron 4 CAGGagaa 

attctcaaaattttcgagt <2 [29295 : 29541 ] -2> attg 

gggtctcctacgctcttcc tggc 

185 SQEWDFIQSKISQRDENGELRLLSSIVEEL 
SQEVVDFIQSKISQRDENGELRLLSSIVEE+ 
SQEVVDFIQSKISQRDENGELRLLSSIVEEV 
29555 acggggtactaaaccggaggccttttagggg 
gaattattacatgagaaagatgttccttaat 
cgatatctaagccgttatggtgagactgagg 
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